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USE OF HEREGULIN AS AN EPITHELIAL CELL GROWTH FACTOR 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

This invention relates to the use of HER2, HERS and/or HER4 ligands, in particular heregulin 
polypeptides, as epithelial cell growth factors. 

Description of Background and Related Art 
The HER (ErbB) family belongs to the subclass I receptor tyrosine kinase superfamily and consists 
10 of three distinct receptors, HER2, HER3, and HER4. A ligand for this ErbB family is the protein heregulin 
(HRG), a muitidomain containing protein with at least 15 distinct isoforms. 

Transduction of signals that regulate cell growth and differentiation is regulated in part by 
phosphorylation of various cellular proteins. Protein tyrosine kinases are enzymes that catalyze this process. 
Receptor protein tyrosine kinases are believed to direct cellular growth via ligand-stimulated tyrosine 
15 phosphorylation of intracellular substrates. Growth factor receptor protein tyrosine kinases of the class I 
subfamily include the 170 kDa epidermal grovrth factor receptor (EGFR) encoded by the erb^\ gene. erb^X 
has been causally implicated in human malignancy. In particular, increased expression of this gene has been 
observed in more aggressive carcinomas of the breast, bladder, lung and stomach. 

The second member of the class I subfamily, pi 85"^" was originally identified as the product of the 
20 transfomning gene from neuroblastomas of chemically treated rats. The neu gene (also called er*B2 and 
HER2) encodes a 185 kDa receptor protein tyrosine kinase. Amplification and/or overexpression of the 
human HER2 gene correlates with a poor prognosis in breast and ovarian cancers (Slamon et aL^ Science 
235:177-182 (1987); and Slamon et al.^ Science 244:707-712 (1989)). Overexpression of HER2 has been 
correlated with other carcinomas including carcinomas of the stomach, endometrium, salivary gland, lung, 
25 kidney, colon and bladder. Accordingly, Slamon et ai in US Pat No. 4,968,603 describe and claim various 
diagnostic assays for determining HER2 gene amplification or expression in tumor cells. Slamon et aL 
discovered that the presence of muhiple gene copies of HER2 oncogene in tumor cells indicates that the 
disease is more likely to spread beyond the primary tumor site, and that the disease may therefore require 
more aggressive treatment than might otherwise be indicated by other diagnostic factors. Slamon et aL 
30 conclude that the HER2 gene amplification test, together with the detennmation of lymph node status, 
provides greatly improved prognostic utility. 

A further related gene, called erbB3 or HERS, has also been described. See US Pat No. 5,183,884; 
Kraus et aL, Proc, NatL Acad ScL USA 86:9193-9197 (1989); EP Pat Appln No 444,961A1; and Kraus et 
aL, Proc, NatL Acad, ScL USA 90:2900-2904 (1993). Kraus etaL (1989) discovered that markedly elevated 
35 levels of er^B3 mRNA were present in certain human mammary tumor cell lines indicating that erbB3, like 
erbBl and erbB2, may play a role in human malignancies. Also, Kraus et aL (1993) showed that EGF- 
dependent activation of the ErbB3 catalytic domain of a chimeric EGFR/ErbB3 receptor resulted in a 
proliferative response in transfected N1H-3T3 cells. This is now believed to be the result of endogenous 
ErbBl or ErbB2 in NIH-3T3. Furthermore, these researchers demonstrated that some human mammary 
,40 tumor cell lines display a significant elevation of steady-state ErbB3 tyrosine phosphorylation further 
indicating that this receptor may play a role in human malignancies. The role oferbB3 in cancer has been 
explored by others. It has been found to be overexpressed in breast (Lemoine et aL, Br, J, Cancer 66:1 1 16- 
1121 (1992)), gastrointestinal (Poller et aL, J, PathoL 168:275-280 (1992), Rajkumer et aL, J. PathoL 
170:271-278 (1993), and Sanidas et aL, Int J. Cancer 54:935-940 (1993)), and pancreatic cancers (Lemoine 
45 etaL, J. PathoL 168:269-273 (1992), and Friess a/.. Clinical Cancer Research 1:1413-1420(1995)). 

The class I subfamily of growth factor receptor protein tyrosine kinases has been further extended to 

1 
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do not have the complete glycosylated spacer region between the Ig-Iike domain and EGF-like domain. 
Only one of the GGFs, GGFII, possessed a N-temiinal signal peptide. See also WO 92/18627; WO 
94/00 140; WO 94/04560; WO 94/26298; and WO 95/32724 which refer to GGFs and uses thereof 

Ho et aL in J. BioL Chem. 270(4): 14523-14532 (1995) describe another member of the heregulin 
5 family called sensory and motor neuron-derived factor (SMDF). This protein has an EGF-like domain 
characteristic of all other heregulin polypeptides but a distinct N-terminal domain. The major structural 
difference between SMDF and the other heregulin polypeptides is the lack in SMDF of the Ig-like domain 
and the "glyco" spacer characteristic of all the other heregulin polypeptides. Another feature of SMDF is the 
presence of two stretches of hydrophobic amino acids near the N-terminus. 
10 While the heregulin polypeptides were first identified based on their ability to activate the HER2 

receptor (see Holmes et al.^ supra\ it was discovered that certain ovarian cells expressing neu and neu- 
transfected fibroblasts did not bind or crosslink to NDF, nor did they respond to NDF to undergo tyrosine 
phosphorylation (Peles et aL, EMBO J, 12:961-971 (1993)). This indicated another cellular component was 

I'JK 

necessary for conferring full heregulin responsiveness. Carraway et aL subsequently demonstrated that I- 

15 rHRGpi J77.244 bound to NIH-3T3 fibroblasts stably transfected with bovine erbBZ but not to non- 
transfecied parental cells. Accordingly, they conclude that ErbB3 is a receptor for HRG and mediates 
phosphorylation of intrinsic tyrosine residues as well as phosphorylation of ErbB2 receptor in cells which 
express both receptors, Carraway et aL, 7. BioL Chem, 269(1 9); 14303- 14306 (1994). Sliwkowski et aL, J. 
BioL Chem. 269(20): 14661-14665 (1994) found that cells transfected with HER3 alone show low affmities 

20 for heregulin, whereas cells transfected with both HER2 and HER3 show higher affinities. 

This observation correlates with the "receptor cross-talking" described previously by Kokai et aL, 
Cell 58:287-292 (1989); Stem et aL, EMBO J, 7:995-1001 (1988); and King et aL, 4:13-18 (1989). These 
researchers found that binding of EGF to the EGFR resulted in activation of the EGFR kinase domain and 
cross-phosphorylation of pi 85"^*^. This is believed to be a result of ligand-induced receptor 

25 heterodimerization and the concomitant cross-phosphorylation of the receptors within the heterodimer (Wada 
et aL ,Cell6\:\ 339- 1 347 ( 1 990)). 

HER4 HER2 

Plowman and his colleagues have similarly studied pi 85 /pi 85 activation. They expressed 
HER2 HER4 

pi 85 alone, pi 85 alone, or the two receptors together in human T lymphocytes and demonstrated 

that heregulin is capable of stimulating tyrosine phosphorylation of pi 85 , but could only stimulate 
HER2 

30 pl85 phosphorylation in cells expressing both receptors. Plowman et aL^ Nature 336:473-475 (1993). 

The biological role of heregulin has been investigated by several groups. For example. Falls et aL, 
V (discussed above) foimd that ARIA plays a role in myotube differentiation, namely affecting the synthesis 
and concentration of neurotransmitter receptors in the postsynaptic muscle cells of motor netu*ons. Corfas 
and Fischbach demonstrated that ARIA also increases the number of sodiimi channels in chick muscle. 
35 Corfas and Fischbach, J. Neuroscience^ 13(5): 2118-2125 (1993). It has also been shown that GGFII is 
mitogenic for subconfluent quiescent human myoblasts and that differentiation of clonal himtan myoblasts in 
the continuous presence of GGFII results in greater nimibers of myotubes after six days of differentiation 
(Sklar et al„ J. Cell Biochem,, Abst W462, 18D, 540 (1994)). See also WO 94/26298 published November 
24, 1994. 

40 Holmes et aL, supra, found that HRG exerted a mitogenic effect on mammary cell lines (such as 

SK-BR-3 and MCF-7). The mitogenic activity of GGFs on Schwann cells has also been reported. See, e.g., 
Brockes et al., J. BioL Chem, 255(1 8):8374-8377 (1980); Lemke and Brockes, J. Neurosci, 4:7 5'S3 (1984 
Lemke and Brockes, J. NeuroscL 4:75-83 (1984); Brockes et aL, Arm, NeuroL 20(3):3 17-322 (1986); 
BvQzkcSy},, Methods in Enzym., 147: 217-225 (1987) and Marchionni era/., 5t^ra. Schwann cells constitute 
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87:6378-6382 (1990). However, early efforts to select high affinity fusion phage failed, presumably due to 
the polyvaience of the phage particles. This problem was solved with the development of a "monovalent" 
phage display system in which the fusion protein is expressed at a low level from a phagemid and a helper 
phage provides a large excess of wild-type coat protein. Bass e/ ai. Proteins 8:309-3 14 (1990); Lowman et 
5 aL, Biochem, 30:10832-10838 (1991). Monovalent phage display can be used to generate and screen a large 
number of variant polypeptides to isolate those that bind with high affinity to a target of interest. 

Approximately 50,000 infants are bom in the United States every year with birth weights, less than 
1.5 kg. About two thirds of these very low birth weight infants have evidence of pulmonary immaturity 
manifested as respiratory distress shortly after birth. The majority of these infants require mechanical 

10 ventilation. Respiratory distress syndrome, caused by insufficient pulmonary surfactant production, as well 
as structural immaturity of the lung, is responsible for respiratory difficuhies observed in these prematurely 
bom neonates. Well developed alveoli are necessary to provide efficient oxygen transfer from the air-liquid 
interface of the lung to the systemic circulation. Surfactant proteins are critical in reducing the alveolar 
surface tension at low lung volumes and preventing alveolar collapse. 

15 A need continues to exist for a method of treatment for respiratory distress syndrome and other 

diseases associated with immature lung development and low lung surfactant production. 

SUMMARY OF THE INVENTION 
In general an object of the invention is to provide a method of inducing epithelial cell growth and 
development for the purpose of promoting repair and healing of tissue damage or injury. 

20 Accordingly, one object of this invention is to provide a method of treating respiratory distress 

syndrome in patients, primarily human patients, in need of such treatment. A further object is to provide a 
method of inducing lung epithelial cell growth and development. A fiirther object is to provide a method of 
increasing lung surfactant protein A production in the lung of persons with impaired oxygen transfer in the 
lung alveoli. This invention is useful in treating infants/neonates with respiratory distress as well as youth 

25 and adults with poor lung function due to lung injury or damage. 

In one aspect of this invention, it has now been discovered that these objects and the broader 
objective of treating conditions associated with epithelial cell damage and injury are achieved by 
administering to a patient in need of such treatment an effective amount of a heregulin ligand, preferably a 
polypeptide or fragment thereof These heregulin (HRG) polypeptides, include HRG-a, HRG-Pl, HRG-P2, 

30 HRG-P3 and other HRG polypeptides which cross-react with antibodies directed against these family 
members and/or which are substantially homologous as defined below and includes HRG variants such as N- 
temiinal and C-terminal fragments thereof. A preferred HRG is the ligand disclosed in Fig. lA - ID and 
fimher designated HRG-a. Other preferred HRGs are the ligands disclosed in Figure 2A - 2E, and 
designated HRG-pi; disclosed in Figure 3A - 3E designated HRG-P2; and disclosed in Figure 4 A - 4C 

35 designated HRG-p3 . 

In another aspect, the invention provides a method in which HRG agonist antibodies are 
administered to achieve the objects of the invention. In this embodiment, HER2/HER3 or fragments thereof 
(which also may be synthesized by in vitro methods) are fused (by recombinant expression or an in vitro 
peptidyl bond) to an immunogenic polypeptide and this fusion polypeptide, in turn, is used to raise 

40 antibodies against a HER2/HER3 epitope. Agonist antibodies are recovered from the serum of immunized 
animals. Alternatively, monoclonal antibodies are prepared from in vitro cells or in vivo immunized animals 
in conventional fashion. If desired, the agonist antibodies may be obtamed by phage display selection from a 
phage library of antibodies or antibody fragments. Preferred antibodies identified by routine screening will 
bind to the receptor, but will not substantially cross-react widi any other known ligands such as EGF, and 

45 will activate the HER receptors HER2, HERS and/or HER4. In addition, antibodies may be selected that are 
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ID N0:12) of Y-HRG obtained as described in U.S. Serial No. 08/891.845. The hydrophobic regi n is 
underiined. The EGF-like domain is shaded, cysteine residues in the EGF-like domain are circled. N-linked 
glycosylation sites are marked above the nucleic acid sequence with a ( ). 

Figure 8 shows the cDNA sequence (SEQ ID NO: 13) and amin_ acid sequence (SEQ ID NO: 14) of 
5 SMDF obtained as described in U.S. Serial No. 08/339,517. An EGF-like domain and the apolar and 
uncharged domains (i.e, "apolar 1" consisting of residues from about 48-62 and "apolar IP consisting of 
residues from about 76-100) are underlined. Cysteines in the EGF-like domain and in the "cysteine knot" in 
the unique N-terminal domain ("NTD-cys knot") are boxed. The stop codon is denoted by the letter "O". 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

JO HRG ligands, in particular polypeptides and agonist antibodies thereof, have affinity for and 

stimulate the HER2, HER3 and/or HER4 receptors or combinations thereof in autophosphorylation. 
Included within the definition of HRG ligands, in addition to HRG-a, HRG-Pl, HRG-P2, HRG-p3 and 
HRG-P2-like, are other polypeptides binding to the HER2, HER3 and/or HER4 receptor, which bear 
substantial amino acid sequence homology to HRG-a or HRG-Pl. Such additional polypeptides fall within 

15 the definition of HRG as a family of polypeptide ligands that bind to the HER2, HER3 and/or HER4 
receptors. 

Hereguiin polypeptides bind with varying afTinhies to the HER2, HER3 and/or HER4 receptors. 
Generally, the HER3 and HER4 receptors are bound with high affmity. It is also known that 
heterodimerization of HER2 with HER3 and of HER2 with HER4 occurs with subsequent receptor cross- 
20 phosphorylation as described above. In the present invention, epithelial cell growth and/or proliferation is 
induced when a hereguiin protein interacts and binds with an individual receptor molecule or a receptor 
dimer such that receptor phosphorylation is induced. Binding and activation of HER2, HER3, HER4 or 
combinations thereof, therefore, is meant to include activation of any form of the receptor necessary for 
receptor activation and biologic function including monomeric receptor and dimeric receptor forms. Dimeric 
25 receptor forms may be referred to below, for example, as HER2/HER3, HER2/HER4, and HER3/HER4. 
I. DEFINITIONS 

In general, the follov/ing words or phrases have the indicated definition when used in the 
description, examples, and claims. 

"Hereguiin" (HRG) iigand is defined herein to be any isolated ligand, preferably a polypeptide 
30 sequence which possesses a biological property of a naturally occurring HRG f>olypeptide. Ligands within 
- the scope of this invention include the NDF, ARIA and GGF grovrth factor hereguiin proteins identified 
above as well as the SMDF and HRG polypeptides discussed in detail herein. These isolated NDF, ARIA 
and GGF hereguiin polypeptides are well known in the art. HRG includes the polypeptides shown in Figs. 
lA-ID, 2A-2E, 3A-3E, 4A-4C, 5A-5D, 6A-6C, 7A-7C and 8 and mammalian analogues thereof. Included 
35 are HRG variants such as the y-HRG described in WO 98/02541, published 22 January 1998; the variants 
described in WO 98/35036, published 13 August 1998; and the SMDF variants described in US Patent NO. 
. 5,770,567 granted 23 June 1998 (WO 96/15244). These applications are incorporated herein in their 
entirety. These variants can be prepared by the methods described below, optionally together with alanine 
scanning and phage display techniques known in the art. Cunningham and Wells, Science 244:1081-85 
40 (1989); Bass et al.. Proteins 8:309-14 (1990); Lowman et al., Biochem. 30:10832-38 (1991). 

The term a "normal" epithelial cell means an epithelial cell which is not transfonned, i.e., is non- 
cancerous and/or non-immortalized. Further, the nonnal epithelial cell is preferably not aneupl id. 
Aneuploidy exists when the nucleus of a cell does not contain an exact multiple f the haploid number of 
chromosomes, one or more chrom somes being present in greater or lesser number than the rest. Typical 
45 properties of transformed cells which fall outside the scope of this invention include the ability to form 
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in its native conformation generally is HRG as found in nature which has not been denatured by chaotropic 
agents, heat or other treatment that substantially modifies the three dimensional structure of HRG as 
detemiined, for example, by migration on nonreducing, nondenaturing sizing gels. Antibody used in this 
determination may be rabbit polyclonal antibody raised by formulating native HRG from a non-rabbit 
5 species in Freund*s complete adjuvant, subcutaneously injecting the formulation, and boosting the immune 
response by intraperitoneal injection of the formulation until the titer of anti-HRG antibody plateaus. 

Ordinarily, biologically or antigenically active HRG will have an amino acid sequence having at 
least 75% amino acid sequence identity with a given HRG sequence, more preferably at least 80%, even 
more preferably at least 90%, and most preferably at least 95%. Identity or homology with respect to an 

10 HRG sequence is defined herein as the percentage of amino acid residues in the candidate sequence that are 
identical with HRG residues in the HRG of Fig. 6A-6C, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent homology, and not considering any conservative substitutions as 
part of the sequence identity. None of N-terminal, C-terminal or internal extensions, deletions, or insertions 
into HRG sequence shall be construed as affecting homology. 

15 Thus, the biologically active and antigenically active HRG polypeptides that are the subject of this 

invention include each entire HRG sequence; fragments thereof having a consecutive sequence of at least 5, 
10, 15, 20, 25, 30 or 40 amino acid residues from HRG sequence; amino acid sequence variants of HRG 
sequence wherein an amino acid residue has been inserted N- or C-terminal to, or within, HRG sequence or 
its fragment as defined above; amino acid sequence variants of HRG sequence or its fragment as defined 

20 above has been substituted by another residue. HRG polypeptides include those containing predetermined 
mutations by, e.g., site-directed or PGR mutagenesis, and other animal species of HRG polypeptides such as 
raoDit, rat, porcine, non-human primate, equine, murine, and ovine HRG and alleles or other naturally 
occurring variants of the foregoing and human sequences; derivatives of HRG or its fragments as defined 
above wherein HRG or its fragments have been covalently modified by substitution, chemical, enzymatic, or 

25 other appropriate means with a moiety other than a naturally occurring amino acid (for example a detectable 
moiety such as an enzyme or radioisotope); glycosylation variants of HRG (insertion of a giycosyiation site 
or deletion of any glycosylation site by deletion, insertion or substitution of appropriate amino acid); and 
soluble forms of HRG, such as HRG-GFD or those that lack a frinctional transmembrane domain. 

"Isolated" means a ligand, such as HRG, which has been identified and separated and/or recovered 

30 from a component of its natural environment. Contaminant components of its natural environment are 
materials which would interfere with diagnostic or therapeutic uses for HRG, and may include proteins, 
hormones, and other substances. In preferred embodiments, HRG will be purified (1) to greater than 95% by 
weight of protein as determined by the Lowry method or other validated protein determination method, and 
most preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N- 

35 terminal or internal amino acid sequence by use of the best commercially available amino acid sequenator 
marketed on the filing date hereof, or (3) to homogeneity by SDS-PAGE using Coomassie blue or, 
preferably, silver stain. Isolated HRG includes HRG in situ w ithin recombinant cells since at least ne 
component of HRG natural environment will not be present Isolated HRG includes HRG from one species 
in a recombinant cell culture of another species since HRG in such circumstances will be devoid of source 

40 polypeptides. Ordinarily, however, isolated HRG will be prepared by at least one purification step. 

In accordance with this invention, HRG nucleic acid is RNA or DNA containing greater than ten 
bases that encodes a biologically or antigenically active HRG, is complementary to nucleic acid sequence 
encoding such HRG, or hybridizes to nucleic acid sequence encoding such HRG and remains stably bound to 
it under stringent conditions. 

45 Preferably, HRG nucleic acid encodes a polypeptide sharing at least 75% sequence identity, more 

preferably at least 80%, still more preferably at least 85%, even more preferably at 90%, and most preferably 
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"Restriction enzyme digestion" of DNA refers to catalytic cleavage of the DNA with an enzyme that 
acts only at certain locations in the DNA. Such enzymes are called restriction endonucieases, and the sites 
for which each is specific is called a restriction site. The various restriction enzymes used herein are 
commercially available and their reaction conditions, cofactors, and other requirements as established by the 
5 enzyme suppliers are used. Restriction enzymes commonly are designated by abbreviations composed of a 
capital lener followed by other letters representing the microorganism from which each restriction enzyme 
originally was obtained, and then a number designating the particular enzyme. In general, about 1 mg of 
plasmid or DNA fragment is used with about 1-2 imits of enzyme in about 20 ml of buffer solution. 
Appropriate buffers and substrate amounts for particular restriction enzymes are specified by the 

10 , manufacturer, incubation of about 1 hour at Zl^'C is ordinarily used, but may vary in accordance with the 
supplier's instructions. After incubation, protein or polypeptide is removed by extraction with phenol and 
chloroform, and the digested nucleic acid is recovered from the aqueous fraction by precipitation with 
ethanol. Digestion with a restriction enzyme may be followed with bacterial alkaline phosphatase hydrolysis 
of the terminal 5* phosphates to prevent the two restriction cleaved ends of a DNA fragment from 

15 "circularizing" or forming a closed loop that would impede insertion of another DNA fragment at the 
restriction site. Unless otherwise stated, digestion of plasmids is not followed by 5* terminal 
dephosphorylation. Procedures and reagents for dephosphorylation are conventional as described in sections 
1.56-1.61 of Sambrook et aL, {Molecular Cloning: A Laboratory Manual New York: Cold Spring Harbor 
Laboratory Press, 1989). 

20 "Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation of 

the digest on polyacrylamide or agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular weight, removal of the 
gel section containing the desired fragment, and separation of the gel from DNA. This procedure is known 
generally. For example, see Lawn era/.. Nucleic Acids Res. 9:6103-61 14 (1981), and Goeddel et ai. Nucleic 

25 Acids Res 8:4057 1 980). 

"Northern analysis" is a method used to identify RNA sequences that hybridize to a known probe 
such as an oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment. The probe is 

labeled with a radioisoto|>e such as ^^P, or by biotinylation, or with an enzyme. The RNA to be analyzed is 
usually electrophoretically separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, 

30 nylon, or other suitable membrane^ and hybridized with the probe, using standard techniques well known in 
the art such as those described in sections 7.39-7.52 of Sambrook et aLy supra, 

"Ligation" refers to the process of forming phosphodiester bonds between two nucleic acid 
fragments. To ligate the DNA fragments together, the ends of the DNA firagments must be compatible with 
each other. In some cases, the ends will be directly compatible after endonuclease digestion. However, it 

35 may be necessary to first convert tiie staggered ends commonly produced after endonuclease digestion to 
blunt ends to make them compatible for ligation. To blunt the ends, the DNA is treated in a suitable buffer 
for at least 15 minutes at \5^C with about 10 units of the Klenow fragment of DNA polymerase I or T4 DNA 
polymerase in the presence of the four deoxyribonucleotide triphosphates. The DNA is then purified by 
phenol-chloroform extraction and ethanol precipitation. The DNA fragments that are to be ligated together 

40 are put in solution in about equimolar amounts. The solution will also contain ATP, ligase buffer, and a 
ligase such as T4 DNA ligase at about iO imits per 0,5 mg of DNA. If the DNA is to be ligated into a vector, 
the vector is first linearized by digestion with the appropriate restriction endonuclease(s). The linearized 
fragment is then treated with bacterial alkaline phosphatase, or calf intestinal phosphatase to prevent self- 
ligation during the ligation step. 

45 "Preparation" of DNA from cells means isolating the plasmid DNA from a culture of the host cells. 

Commonly used methods for DNA preparation are the large and small-scale plasmid preparations described 
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"Protein microsequencing" was accomplished based upon the following procedures. Proteins from 
the final HPLC step were either sequenced directly by automated Edman degradation with a model 470A 
Applied Biosystems gas phase sequencer equipped with a I20A PTH amino acid analyzer or sequenced after 
digestion with various chemicals or enzymes, PTH amino acids were integrated using the 
5 CHROMPERFECT data system (Justice Innovations, Palo Alto, CA). Sequence interpretation was 
perfonned on a VAX 11/785 Digital Equipment Corporation computer as described (Henzel et ai.^ J. 
Chromatography 404:41-52 (1987)). In some cases, aiiquots of the HPLC fractions were electrophoresed 
on 5-20% SDS polyacrylamide gels, electrotransferred to a PVDF membrane (PROBLOTT, ABI, Foster 
City, CA) and stained with Coomassie Brilliant Blue (Matsudaira, P., J. Biol, Chem. 262:10035-10038, 

10 1987). The specific protein was excised from the blot for N terminal sequencing. To determine internal 
protein sequences, HPLC fractions were dried under vacuum (SPEEDVAC), resuspended in appropriate 
buffers, and digested with cyanogen bromide, the iysine-specific enzyme Lys-C (Wako Chemicals, 
Richmond, VA) or Asp-N (Boehringer Mannheim, Indianapolis, Ind.). After digestion, the resultant peptides 
were sequenced as a mixture or were resolved by HPLC on a C4 column developed with a propanol gradient 

15 in 0. 1% TFA before sequencing as described above. 

"Antibodies" (Abs) and "immunoglobulins'* (Igs) are glycoproteins having the same structural 
characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include 
both antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter 
kind are, for example, produced at low levels by the lymph system and at increased levels by myelomas. 

20 Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 

ft^gments, each with a single antigen-binding site, and a residual "Fc" fragment, whose name reflects its 
ability to crystallize readily. Pepsin treatment yields an F(ab*)2 fragment that has two antigen-combining 
sites and is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 

25 site. This region consists of a dimer of one heavy chain and one light chain variable domain in tight, non- 
covalent association. It is in this configuration that the three hypervariable regions of each variable domain 
interact to define an antigen-binding site on the surface of the Vj^-Vl dimer. Collectively, the six 

hypervariable regions confer antigen-binding specificity to the antibody. However, even a single variable 
domain (or half of an Fv comprising only three hypervariable regions specific for an antigen) has the ability 

30 to recognize and bind antigen, although at a lower affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab* fi^gments differ from Fab fragments by the addition of a few residues at the 
carboxyl terminus of the heavy chain CHI domain including one or more cysteine(s) from the antibody 
hinge region. Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of the constant 

35 domains bear a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab* 

fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments are 
also known. 

The **light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly, distinct types, called kappa (k) and lambda {X\ based on the amino acid sequences of their 
40 constant domains. 

E>epending on the amino acid sequence of the c nstant d main of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, 
and IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgGl, IgG2, IgG3, 
IgG4, IgA, and IgA2. The heavy-chain constant domains that correspond to the different classes of 
45 immunoglobulins are called a, 6, e, ^d ^, respectively. The subunit structures and three-dimensional 
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antigen binding. For a review of sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies^ vol. 
1 13, Rosenburg and Moore eds. Springer- Verlag, New York, pp. 269-3 15 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites» which 
fragments comprise a heavy chain variable domain (Vj^) connected to a light chain variable domain (V^) in 

5 the same polypeptide chain (V^ - Vl). By using a linker that is too short to allow pairing between the two 

domains on the same chain, the domains are forced to pair with the complementary domains of another chain 
and create two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 
93/1 1 161; and HolHnger et aL Proc, NatL Acad. Sci. USA 90:6444-6448 (1993). 

The expression "linear antibodies" when used throughout this application refers to the antibodies 
10 described in Zapata et ai Protein Eng. 8(10): 1057- 1062 (1995). Briefly, these antibodies comprise a pair of 
tandem Fd segments (Vh-C^I-Vh-ChO which form a pair of antigen binding regions. Linear antibodies 
can be bispecific or monospecific. 

II. USE AND PREPARATION OF HRG SEQUENCES 

H. PREPARATION OF HRG SEQUENCES. INCLUDING VARIANTS 
15 The system to be employed in preparing HRG sequence will depend upon the particular HRG 

sequence selected. If the sequence is sufficiently small HRG may be prepared by in vitro polypeptide 
synthetic methods. Most commonly, however, HRG will be prepared in recombinant cell culture using the 
host-vector systems described below. Suitable HRG includes any biologically active and antigenetically 
active HRG. 

20 In general, mammalian host cells will be employed, and such hosts may or may not contain post- 

translational systems for processing HRG preprosequences in the normal fashion. If the host cells contain 
such systems then it will be possible to recover natural subdomain fragments such as HRG-GFD from the 
cultures. If not, then the proper processing can be accomplished by transforming the hosts with the required 
enzyme(s) or by supplying them in an in vitro method. However, it is not necessary to transform cells with 

25 the complete prepro or structural genes for a selected HRG when it is desired to only produce fragments of 
HRG sequences such as an HRG-GFD. For example, a start codon is ligated to the 5* end of DMA encoding 
an HRG-GFD, this DNA is used to transform host cells and the product expressed directly as the Met N- 
tcrminal form (if desired, the extraneous Met may be removed in vitro or by endogenous N-terminai 
demethionylases). Alternatively, HRG-GFD is expressed as a fusion with a signal sequence recognized by 

30 the host cell, which will process and secrete the mature HRG-GFD as is further described below. Amino 
acid sequence variants of native HRG-GFD sequences are produced in the same way. 

HRG sequences located between the first N-terminal mature residue and the first N-terminal residue 
of HRG-GFD sequence, termed HRG-NTD, may function at least in part as an unconventional signal 
sequence or as a normally circulating carrier/precursor for HRG-GFD having unique biological activity. 

35 HRG-NTD is produced in the same fashion as the full length molecule but from expression of DNA in which 
a stop codon is located at the C-terminus of HRG-NTD. In addition, HRG variants are expressed from DNA 
encoding protein in which both the GFD and NTD domains are in their proper orientation but which contain 
an amino acid insenion, deletion or substitution at the GFD-NTD cleavage site (located within the sequence 
VKC) which inhibits or prevents proteolytic cleavage of the NTD-GFD joining site in vivo^ and wherein a 

40 stop codon is positioned at the 3' end of the GFD- encoding sequence. In an example of this group f 
variants (termed HRG-NTDXGFD), (1) the lysine residue found in the NTD-GFD joining sequence VKC is 
deleted or (preferably) substituted by another residue other than arginyl such as histidyl, alanyl, threonyl or 
seryl and (2) a stop codon is introduced in the sequence RCT or RCQ in place of cysteinyl, or threonyl (for 

HRG-a) or glutaminyl (for HRG-p). 
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screening genomic DNA libraries include, but are not limited to, oligonucleotides; cDNAs or fragments 
thereof that encode the same or a similar gene; and/or homologous genomic DNAs or fragments thereof. 
Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures 
as described in chapters 10-12 of Sam brook et ai, supra, 
5 An alternative means to isolate the gene encoding HRG-a is to use polymerase chain reaction (PCR) 

methodology as described in section 14 of Sambrook et ai, supra. This method requires the use of 
oligonucleotide probes that will hybridize to HRG-a, Strategies for selection of oligonucleotides are 
described below. 

Another alternative method for obtaining the gene of interest is to chemically synthesize it using one 
10 of the methods described in Engeis et al (Agnew. Chem. Int. Ed Engl. ^ 28: 716-734,1989. These methods 
include triester, phosphite, phosphoramidite and H-Phosphonate methods, PCR and other autoprimer 
methods, and oligonucleotide syntheses on solid supports. These methods may be used if the entire nucleic 
acid sequence of the gene is known, or the sequence of the nucleic acid complementary to the coding strand 
is available, or alternatively, if the target amino acid sequence is known, one may infer potential nucleic acid 
15 sequences using known and preferred coding residues for each amino acid residue. 

A preferred method of practicing this invention is to use carefully selected oligonucleotide 
sequences to screen cDNA libraries from various tissues, preferably human breast, colon, salivary gland, 
placental, fetal, brain, and carcinoma cell lines. Other biological sources of DNA encoding an heregulin-Iike 
ligand include other mammals and birds. Among the preferred mammals are members of the following 
20 orders: bovine, ovine, equine, murine, and rodentia. 

The oligonucleotide sequences selected as probes should be of sufficient length and sufficiently 
unambiguous that false positives are minimized. The actual nucleotide sequence(s) may, for example, be 

based on conserved or highly homologous nucleotide sequences or regions of HRG-a. The oligonucleotides 
may be degenerate at one or more positions. The use of degenerate oligonucleotides may be of particular 
25 importance where a library is screened from a species in which preferential codon usage in that species is not 
known. The oligonucleotide must be labeled such that it can be detected upon hybridization to DNA in the 

library being screened. The preferred method of labeling is to use ^^P-labeled ATP with polynucleotide 
kinase, as is well known in the art, to radiolabel the oligonucleotide. However, other methods may be used 
to label the oligonucleotide, including, but not limited to, biotinylation or enzyme labeling. 
30 Of particular interest is HRG-a nucleic acid that encodes a full-length polypeptide. In some 

preferred embodiments, the nucleic acid sequence includes the native HRG-a signal sequence. Nucleic acid 
having all the protein coding sequence is obtained by screening selected cDNA or genomic libraries, and, if 
: necessary, using conventional primer extension procedures as described in section 7.79 of Sambrook et aL, 
supra, to detect prectirsors and processing intermediates of mRNA that may not have been reverse- 
35 transcribed into cDNA. 

HRG-a encoding DNA of Figures lA-lD may be used to isolate DNA encoding the analogous 
ligand from other animal species via hybridization employing the methods discussed above. The preferred 
animals are mammals, particularly bovine, ovine, equine, feline, canine and rodentia, and more specifically 
rats, mice and rabbits. 

40 B. Amino Acid Sequence Variants of Heregulin 

Amino acid sequence variants of HRG are prepared by introducing appropriate nucleotide changes 
into HRG DNA, or by in vitro syndiesis of the desired HRG polypeptide. Such variants include, for 
example, deletions from, or insertions or substitutions of, residues within the amino acid sequence shown for 
human HRG sequences. Any combination of deletion, insertion, and substitution can be made to arrive at the 

45 final construct, provided that the final construct possesses the desired characteristics. Excluded from the 
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represent naturally occurring alleles (which will not require manipulation of HRG DNA) or predetermined 
mutant forms made by mutating the DNA, either to arrive at an allele or a variant not found in nature. In 
general, the location and nature of the mutation chosen will depend upon HRG characteristic to be modified. 
Obviously, such variations that, for example, convert HRG into a known receptor ligand, are not included 
5 within the scope of this invention, nor are any other HRG variants or polypeptide sequences that are not 
novel and unobvious over the prior art. 

Amino acid sequence deletions generally range from about I to 30 residues, more preferably about 
I to 10 residues, and typically about 1 to 5 are contiguous. Deletions may be introduced into regions of low 
homology with other EGF family precursors to modify the activity of HRG. Deletions from HRG in areas of 
10 substantial homology with other EGF family sequences will be more likely to modify the biological activity 
of HRG more significantly. The number of consecutive deletions will be selected so as to preserve the 
tertiary structure of HRG in the affected domain, e.g., cysteine crosslinking, beta-pleated sheet or alpha 
helix. 

Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions ranging in length 
1 5 from one residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions 
of single or multiple amino acid residues. Intrasequence insertions (i.e., insertions within HRG sequence) 
may range generally from about 1 to 10 residues, more preferably I to 5, and most preferably I to 3. 
Examples of temninal insertions include HRG with an N-terminal methionyl residue (an artifact of the direct 
expression of HRG in bacterial recombinant cell culture), and fusion of a heterologous N-terminal signal 
20 sequence to the N-terminus of HRG to facilitate the secretion of mature HRG from recombinant host cells. 
Such signal sequences generally will be obtained from, and thus be homologous to, the intended host cell 
. species. Suitable sequences include STII, tPA or Ipp for £. coli, alpha factor for yeasi, and viral signals such 
as herpes gD for mammalian cells. 

Other insertional variants of HRG include the fusion to the N- or C-terminus of HRG of an 
25 immunogenic polypeptide, e.g., bacterial polypeptides such as beta-lactamase or an enzyme encoded by the 
£. coU trp locus, or yeast protein, bovine serum albumin, and chemotactic polypeptides. C-terminal fusions 
of HRG-ECD with proteins having a long half-life such as immunoglobulin constant regions (or other 
inrmunoglobulin regions), albumin, or ferritin, as described in WO 89/02922, publishe.-* 6 April 1989 are 
contemplated. 

30 Another group of variants arc amino acid substitution variants. These variants have at least one 

amino acid residue in HRG molecule removed and a different residue inserted in its place. The sites f 
greatest interest for substitutional mutagenesis include sites identified as the active site(s) of HRG, and sites 
where the amino acids found in HRG ligands from various species are substantially different in terms of side- 
chain bulk, charge, and/or hydrophobicity. A likely sub-domain of HRG-GFD having biological activity as a 

35 growth factor is the C-terminal segment, in particular within the sequence about from glycine 218 to valine 
226 (HRG-a), and glycine 218 to lysine 228/serine 228 (HRG-P) based upon analogy to the EGF sub- 
sequence found to have EGF activity. 

Other sites of interest are those in which particular residues of HRG-like ligands obtained from 
various species are identical. These positions may be important for the biological activity of HRG. These 

40 sites, especially those falling within a sequence of at least three other identically conserved sites, are 
substituted in a relatively conservative manner. Such conservative substitutions are shown in Table 1 under 
the heading of "preferred substitutions". If such substitutions result in a change in biological activity, then 
more substantial changes, denominated exemplary substitutions in Table 1, or as further described below in 
reference to amino acid classes, are introduced and the products screened. 
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In another embodiment, any methionyl residue other than the staning methionyl residue f the 
signal sequence, or any residue located within about three residues N- or C-temiinal to each such methionyl 
residue, is substituted by another residue (preferably in accord with Table 1) or deleted. Alternatively, about 
1 -3 residues are inserted adjacent to such sites. 
5 Any cysteine residues not involved in maintaining the proper conformation of HRG also may be 

substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant 
crosslinking. 

Sites particularly suited for substitutions, deletions or insertions, or use as fragments, include, 
numbered from the N-terminus of HRG-a of Figure lA - ID: 
10 1) potential glycosaminoglycan addition sites at the serine-glycine dipeptides at 42-43, 64-65. 151- 
152; 

2) potential asparagine-Hnked glycosylation at positions 164, 170, 208 zmd 437, sites (NDS) 164-166, 
(NIT) 1 70- 1 72, (NTS) 208-2 1 0, and NTS (609-6 1 1 ); 

3) potential O-glycosylation in a cluster of serine and threonine at 209-21 8; 
1 5 4) cysteines at 226, 234, 240, 254, 256 and 265; 

5) transmembrane domain at 287-309; 

6) loop 1 delineated by cysteines 226 and 240; 

7) loop 2 delineated by cysteines 234 and 254; 

8) loop 3 delineated by cysteines 256 and 265; and 

20 9) potential protease processing sites at 2-3, 8-9, 23-24, 33-34, 36-37. 45-46, 48-49, 62-63, 66-67, 86- 
87, 1 10-1 11, 123-124, 134-135, 142-143, 272-273, 278-279 and 285-286; 

Analogous regions in HRG-Pl may be determined by reference to its* sequence. The analogous 

HRG-pi amino acids may be mutated or modified as discussed above for HRG-a. Analogous regions in 

HRG-p2 may also be determined by reference to its* sequence. The analogous HRG-P2 amino acids may be 

25 mutated or modified as discussed above for HRG-a or HRG-pi. Analogous regions in HRG-p3 may be 
determined by reference to its' sequence. Further, the analogous HRG-P3 amino acids may be mutated or 
modified as discussed above for HRG-a, HRG-pl, or HRG-p2. 

Another HRG variant is y-HRG (or gamma-heregulin). y-HRG is any polypeptide sequence that 
possesses at least one biological property of native sequence y-HRG having SEQ ID NO:l 1. The biological 

30 property of this variant is the same as for HRG noted above. This variant encompasses not only the 
polypeptide isolated from a native y-HRG source such as human MDA-MB-175 ceils or from another 
source, such as another animal species, but also the polypeptide prepared by recombinant or synthetic 
methods, it also includes variant forms including functional derivatives, allelic variants, naturally occurring 
isoforms and analogues thereof. Sometimes the y-HRG is "native y-HRG" which refers to endogenous y- 

35 HRG polypeptide which has been isolated from a mammal. The y-HRG can also be "native sequence y- 
HRG" insofar as it has the same amino acid sequence as a native y-HRG (e.g, himian y-HRG shown in Fig. 
7A-7C). Amino acid sequence variants of the native sequence are prepared by introducing appropriate 
nucleotide changes into the native sequence DNA, or by m vitro synthesis of the desired polypeptide. Such 
variants include, for example, deletions from, or insertions or substitutions of, residues within the amino acid 

40 sequence shown for the human protein in Fig. 7A-7C as generally described above for other HRG. Any 
combination of deletion, insertion, and substitution is made to arrive at the final construct, provided that the 
final construct possesses the desired chzuricteristics. The amino acid changes also may alter post- 
translational processes of the native sequence, such as changing the number or position of O-Iinked 
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AI83G, K185E, E186R, K187E, TI88G, F197Y, M198R, D201T; 

A183G, K185E, E186R. K187E, TI88G, P205Y, S206G, R207Y, Y208L, L209M; 

A183G, K185E, E186R, K187E, T188G, FI97Y, MI98R, D201T, P205Y, S206a R207Y, Y208L, L209M; 
AI83G, K185E, EI86R, K187E, TI88G, M226I; 
5 F197Y, M198R, D201T. P205Y, S206G, R207Y, Y208U L209M; 

F197Y, M198R, D20IT. P205Y, S206G, R207Y, Y208L, L209M, M226I; 
F197Y. M198R, D20IT, M226I; 

AJ83G, K185E, E186R, K187E, T188G, F197Y, M198R, D201T, M226I; 
. A183G. K185E, EI86R, K187E, T188G, P205Y, S206G, R207Y, Y208U L209M. M226I; 

. 10 A183G, K185E, E186R, K187E, T188G, F197Y, MI98R, D201T, P205Y, S206G, R207Y, Y208U L209M, 

M226I; 

F197Y, M198R, D201T, P205Y, S206G, R207Y, Y208U L209M, N223H, M226I; and 

A183G, K185E, E186R, K187E, T188G. F197Y, M198R, D201T, P205Y, S206G, R207Y, Y208L, L209M, 

N223H, M226I. 

13 In addition to including one or niore of the amino acid substitutions disclosed herein^ the heregulin 

variant can have one or more other modifications, such as an amino acid substitution, an insertion of at least 
one amino acid, a deletion of at least one amino acid, or a chemical modification. For example, the invention 
provides a heregulin variant that is a fragment. In a variation of this embodiment, the fragment includes 
residues corresponding to a portion of human heregulin-pl extending from about residue 175 to about 

20 residue 230 (i.e., the EGF-like domain). For example, the fragment can extend from residue 1 77 to residue 
244 and may be prepared by recombinant techniques (rHRCP 1 - 1 77-244). 

DNA encoding amino acid sequence variants of HRG is prepared by a variety of methods known in 
the art. These methods include, but are not limited to, isolation from a natural source (in the case of naturally 
occurring amino acid sequence variants) or preparation by oligonucleotide-mediated (or site-directed) 

25 mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared variant or a non-variant 
version of HRG. These techniques may utilize HRG nucleic acid (DNA or RNA), or* nucleic acid 
complementary to HRG nucleic acid. 

Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution, deletion, 
and insertion variants of HRG DNA. This technique is well known in the art as described by Adehnan et a!., 

30 DNA, 2: 183 (1983). Briefly, HRG DNA is altered by hybridizing an oligonucleotide encoding the desired 
mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage 
containing the unaltered or native DNA sequence of HRG. After hybridization, a DNA polymerase is used 
to synthesize an entire second complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in HRG DNA. 

35 Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal 

oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either 
side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize 
properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such as that described by Crea ei ai. {Proc, Nail. Acad ScL USA, 75: 

40 5765,1978). 

Single-stranded DNA template may also be generated by denaturing double-stranded plasmid (or 
other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), 
the ligonucleotide is hybridized to the single-stranded template under suitable hybridization conditions. A 
45 DNA polymerizing enzyme, usually the Klenow fragment of DNA polymerase I, is then added to synthesize 
the complementary strand of the template using the oligonucleotide as a primer for synthesis. A 
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primers is designed to overlap the position of the mutation and to contain the mutation; the sequence of the 
other primer must be identical to a stretch of sequence of the opposite strand of the piasmid, but this 
sequence can be located anywhere along the placid DNA. It is preferred, however, that the sequence of the 
second primer is located within 200 nucleotides from that of the first, such that in the end the entire amplified 
5 region of DNA bounded by the primers can be easily sequenced. PCR amplification using a primer pair like 
the one just described results in a population of DNA fragments that differ at the position of the mutation 
specified by the primer, and possibly at other positions, as template copying is somewhat error-prone. 

If the ratio of template to product material is extremely low, the vast majority of product DNA 
fragments incorporate the desired mutation(s). This product material is used to replace the corresponding 
10 region in the piasmid that served as PCR template using standard DNA technology. Mutations at separate 
positions can be introduced simultaneously by either using a mutant second primer, or performing a second 
PCR with different mutant primers and ligating the two resulting PCR fragments simultaneously to the vector 
fragment in a three (or moFe)-part ligation. 

In a specific example of PCR mutagenesis, template piasmid DNA (Img) is linearized by digestion 
15 with a restriction endonuclease that has a unique recognition site in the piasmid DNA outside of the region to 
be amplified. Of this material, 100 ng is added to a PCR mixture containing PCR buffer, which contains the 
four deoxynucleotide tri-phosphates and is included in the GENEAMP kits (obtained from Perkin-Elmer 
Cetus, Norwalk, CT and Emeryville, CA), and 25 pmole of each oligonucleotide primer, to a final volume of 
50 ml. The reaction mixture is overlaid with 35 ml mineral oil. The reaction is denatured for 5 minutes at 
20 100**C, placed briefly on ice, and then I ml Thermits aquaticus (Tag) DNA polymerase (5 units/ml, 
purchased from Perkin-EImer Cetus, Norwalk, CT and Emeryville, CA) is added below the mineral oil layer. 
The reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin-Ehner Cetus) 
programmed as follows: 

2min. 55X, 

25 30 sec. 72*C, then 19 cycles of the following: 

30 sec. 94°C, 
30 sec. 55''C, and 
30 sec. 72*'C. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous 

30 phase transferred to a new vial, extracted with phenol/chloroform (50:50:vol), and ethanol precipitated, and 
the DNA is recovered by standard procedures. This material is subsequently subjected to the appropriate 

; treatments for insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described by 
Wells et al. {Geney 34: 315,1985). The starting material is the piasmid (or other vector) comprising HRG 

35 DNA to be mutated. The codon(s) in HRG DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, 
they may be generated ttsing the above-described oligonucleotide-mediated mutagenesis method to introduce 
them at appropriate locations in HRG DNA. After the restriction sites have been introduced into the piasmid, 
the piasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of 

40 the DNA between the restriction sites but containing the desired mutation(s) is synthesized using standard 
procedures. The two strands are synthesized separately and then hybridized together using standard 
techniques. This double-stranded oligonucleotide is referred to as the cassette. This cassette is designed to 
have 3' and 5* ends that are compatible with the ends of the linearized piasmid, such that it can be directly 
ligated to the piasmid. This piasmid now contains the mutated HRG DNA sequence. 

45 C Insertion of DNA into a Cloning Vehici 

The cDNA or genomic DNA encoding native or variant HRG is inserted into a repiicable vector for 
further cloning (amplification of the DNA) or f r expression. Many vectors are available, and selection of 
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encoding HRG is more complex than that of an exogenously replicated vector because restriction enzyme 
digestion is required to excise HRG DNA. DNA can be amplified by PGR and directly transfected into the 
host cells without any replication component 

(iii) Seiection Gene Compon nt 
5 Expression and cloning vectors should contain a selection gene, also termed a selectable marker. 

This gene encodes a protein necessary for the survival or growth of transformed host cells grown in a 
selective culture medium. Host cells not transformed with the vector containing the selection gene will not 
: survive in the culture medium. Typical selection genes encode proteins that (a) confer resistance to 
antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement 
10 - auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g., the gene 
encoding D-alanine racemase for BaciliL 

One example of a selection scheme utilizes a drug to arrest growth of a host cell. Those cells that 
, are successfully transformed with a heterologous gene express a protein conferring drug resistance and thus 
survive the selection regimen. Examples of such dominant selection use the drugs neomycin (Southern et 
15 a/., J, Moiec. AppL GeneL I: 327,1982), mycophenolic acid (Mulligan et aL, Science 209: 1422,1980) or 
hygromycin (Sugden et al,^ Moi CelL Biol. 5: 410-413,1985), The three examples given above employ 
bacterial genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 
(geneticin), xgpt (mycophenolic acid), or hygromycin, respectively. 

Another example of suitable selectable maricers for mammalian cells are those that enable the 
20 identification of cells competent to take up HRG nucleic acid, such as dihydrofoiate reductase (DHFR) or 
thymidine kinase. The mammalian cell transformants are placed under selection pressure which only the 
transformants are uniquely adapted to survive by virtue of having taken up the marker. Selection pressure is 
imposed by culturing the transformants under conditions in which the concentration of selection agent in the 
medium is successively changed, thereby leading to amplification of both the selection gene and the DNA 
25 that encodes HRG. Amplification is the process by which genes in greater demand for the production of a 
protein critical for growth are reiterated in tandem within the chromosomes of successive generations of 
recombinant cells. Increased quantities of HRG are synthesized from the amplified DNA. 

For example, cells transformed with the DHFR selection gene are first identified by culturing all of 
the transformants in a culture medium that contains methotrexate (Mtx), a competitive antagonist of DHFR. 
30 An appropriate host cell when wild-type DHFR is employed is the Chinese hamster ovary (CHO) cell line 
deficient in DHFR activity, prepared and propagated as described by Urlaub and Chasin, Proc. Natl Acad, 
Sci, USA, 77: 4216, 1980. The transformed cells are then exposed to increased levels of methotrexate. This 
V leads to the synthesis of multiple copies of the DHFR gene, and, concomitantly, multiple copies of other 
DNA comprising the expression vectors, such as the DNA encoding HRG. This amplification technique can 
35 be used with any otherwise suitable host, e.g., ATCC No. CCL61 CHO-KI, notwithstanding the presence of 
endogenous DHFR if, for example, a mutant DHFR gene that is highly resistant to Mtx is employed (HP 
1 17,060). Alternatively, host cells (particularly wild-type hosts that contain endogenous DHFR) transformed 
V. or co-transformed with DNA sequences encoding HRG, wild-type DHFR protein, and another selectable 
marker such as aminoglycoside 3* phosphotransferase (APH) can be selected by cell growth in mediimi 
40 - containing a selection agent for the selectable marker such as an aminogiycosidic antibiotic, e.g., kanamycin, 
neomycin, or G41 8 (see U.S. PaL No. 4,965,199). 

A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 
(Stinchcomb et al. Nature, 282: 39, 1979; Kingsman et aL Gene, 7: 141, 1979; or Tschemper et aL, Gene^ 
10: 157, 1980). The trp\ gene provides a selection marker for a mutant strain of yeast lacking the ability to 
45 grow in tryptophan^ for example, ATCC No. 44076 or PEP4-1 (Jones, Genetics, 85: 12, 1977). The presence 
of the trp\ lesion in the yeast host cell genome then provides an effective environment for detecting 
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promoters, e.g., the actin promoter or an immunoglobulin promoter, from heat-shock promoters, and from 
the promoter normally associated with HRG sequence, provided such promoters are compatible with the host 
cell systems. 

The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction 
5 fragment that also contains the SV40 viral origin of replication (Fiers et aL. Nature, 273:113 (1978); 

Mulligan and Berg, Science, 209: 1422-1427 (1980); Pavlakis et ai, Proc. NatL Acad. ScL USA. 78: 7398- 
.. . 7402 (1981)). The immediate early promoter of the human cytomegalovirus is conveniently obtained as a 
HincHW E restriction fragment (Greenaway et ai. Gene, 18: 355-360 (1982)). A system for expressing DNA 
in mammalian hosts using the bovine papilloma virus as a vector is disclosed in U.S. 4,419,446. A 
10 modification of this system is described in U.S. 4,601,978. See also Gray et aL, Nature, 295: 503-508 
(1982) on expressing cDNA encoding immune interferon in monkey cells; Reyes et aL, Nature, 297: 598- 
601 (1982) on expression of human b-interferon cDNA in mouse cells under the control of a thymidine 
kinase promoter from herpes simplex virus; Canaani and Berg, Proc. NatL Acad, ScL USA, 79: 5166-5170 
(1982) on expression of the human interferon bl gene in cultured mouse and rabbit cells; and Gorman et aL^ 
15 Proc, NatL Acad ScL USA, 79: 6777-6781 (1982) on expression of bacterial CAT sequences in CV-1 
monkey kidney cells, chicken embryo fibroblasts, Chinese hamster ovary cells, HeLa cells, and mouse NIH- 
3T3 cells using the Rous sarcoma virus long terminal repeat as a promoter. 

(v) Enhancer Element Component 

Transcription of a DNA encoding HRG of this invention by higher eukaryotes is often increased by 
20 inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DMA, usually about 
from 10-300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and 
position independent having been found 5' (Laimms et aL, Proc. NatL Acad ScL USA, 78: '^'93, 1981) and 3' 
(Lusky et aL, MoL Cell Bio., 3: 1 108, 1983) to the transcription unit, within an intron (Banerji et aL, Cell, 
33: 729, 1983) as well as within the coding sequence itself (Osborne et aL\ MoL Cell Bio., 4: 1293, 1984). 
25 Many enhancer sequences are now known from mammalian genes (glob in, elastase, albumin, a- fetoprotein 
and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include 
the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovinjis enhancers 
(see also Yaniv, Nature, 297: 17-18 (1982)) on enhancing elements for activation of eukaryotic promoters. 
30 The enhancer may be spliced into the vector at a position 5* or 3* to HRG DNA, but is preferably located at a 
Site 5* from the promoter. 

(vi) Transcription Termination Component 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the tertnination 
35 of transcription and for stabilizing the mRNA. Such sequences are commonly available fi-om the 5* and, 
occasionally 3* untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain 
nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA 
encoding HRG, The 3* untranslated regions also include transcription termination sites. 

Construction of suitable vectors containing one or more of the above listed components the desired 
40 coding and control sequences employs standard ligation techniques. Isolated plasmids or DNA fragments 
are cleaved, tailored, and religated in the form desired to generate the plasmids required. 

For analysis to confirm correct sequences in plasmids constructed, the ligation mixmres are used to 
transform E. coli K12 strain 294 (ATCC 31,446) and successful transformants selected by ampicillin or 
tetracycline resistance where appropriate. Plasmids from the transformants are prepared, analyzed by 
45 restriction endonuclease digestion, and/or sequenced by the method of Messing et aL, Nucleic Acids Res. 9: 
309 (1981) or by the method of Maxam et aL^ Methods in Enzymology 65: 499 (1980). 
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Plant cell cultures of cotton, com, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts. 
Typically, plant cells are transfected by incubation with certain strains of the bacterium Agrobacterium 
tumefaciens. which has been previously manipulated to contain HRG DNA. During incubation of the plant 
cell culture with A. nimefaciens^ the DNA encoding HRG is transferred to the plant cell host such that it is 
5 transfected, and will, under appropriate conditions, express HRG DNA. In addition, regulatory and signal 
sequences compatible with plant cells are available, such as the nopaline synthase promoter and 
, , polyadeny lation signal sequences (Depicker et aL, J, MoL AppL Gen., 1: 561 (1982)). In addition, DNA 
: segments isolated firom the upstream region of the T-DNA 780 gene are capable of activating or increasing 
.transcription levels of plant-expressible genes in recombinant DNA-containing plant tissue (see EP 321,196, 
1 0 published 2 1 June 1 989). 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) has become a routine procedure in recent years {Tissue Culture, Academic Press, Kruse and 
Patterson, editors (1973)). Examples of useful mammalian host cell lines are monkey kidney CVl line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned 
15 for growth in suspension culture, Graham et aiy J. Gen ViroL^ 36: 59, 1977); baby hamster kidney cells 
• (BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR (CHO, Uriaub and Chasin, Proc. Natl, Acad, 
ScL USA, 77: 4216 (1980)); mouse Sertoli cells (TM4, Mather, BioL Reprod, 23: 243-251 (1980)); monkey 
kidney cells (CVl ATCC CCL 70); African green monkey kidney cells (VERC)-76, ATCC CRL- 1587); 
human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); 
20 buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung ceils (W138. ATCC CCL 75); human liver 
cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et aL, 
Annals N.Y, Acad. ScL, 383: 44-68 (1982)); MRC 5 cells; FS4 cells; and a human hepatoma cell line (Hep 
G2). Preferred host cells are human embryonic kidney 293 and Chinese hamster ovary cells. 

Host cells are transfected and preferably transformed with the above-described expression or 
25 cloning vectors of this invention and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 

Transfection refers to the taking up of an expression vector by a host cell whether or not any coding 
sequences are in feet expressed. Numerous methods of transfection arc known to the ordinarily skilled 
artisan, for example, CaP04 and electroporation. Successful transfection is generally recognized when any 

30 indication of the operation of this vector occurs within the host ceil. 

Transformation means introducing DNA into an organism so that the DNA is replicable, either as an 
. : extrachromosomal element or by chromosomal integration. Depending on the host cell used, transformation 
^is done using standard techniques appropriate to such cells. The calcium treatment employing calcium 
chloride, as described in section 1.82 of Sambrook et aL supra, is generally used for prokaryotes or other 
35 cells that contain substantial cell-wall barriers. Infection with Agrobacterium tutnefaciens is used for 
transformation of certain plant cells, as described by Shaw et al,y Gene^ 23: 315 (1983) and WO 89/05859, 
published 29 June 1989. For mammalian cells without such cell walls, the calcium phosphate precipitation 
method described in sections 16.30-1637 of Sambrook et al, supra, is preferred. General aspects of 
.-mammalian cell host system transformations have been described by Axel in U.S. 4,399,216, issued 16 
40 August 1983. Transformations into yeast are typically carried out according to the method of Van Solingen 
et al., J. Bact,, 130: 946 (1977) and Hsiao et aL, Proc. Natl Acad Scl (USA), 16: 3829 (1979). However, 
other methods for introducing DNA into cells such as by nuclear injection, electroporation, or protoplast 
fusion may also be used. 

E. Cuituring the Host Cells 
45 Prokaryotic cells used to produce HRG polypeptide of this invention are cultured in suitable media 

as described generally in Sambrook et al., supra. 
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prepared against a native HRG polypeptide or against a synthetic peptide based on the DNA sequences 
provided herein as described further below. 

G. Purification of The HereguHn Polypeptides 

HRG is recovered from a cellular membrane fraction. Alternatively, a proteolytically cleaved or a 
5 truncated expressed soluble HRG fragment or subdomain are recovered from the culture medium as a soluble 
polypeptide. A HRG is recovered from host cell lysates when directly expressed without a secretory signal. 

When HRG is expressed in a recombinant cell other than one of human origin, HRG is completely 
free of proteins or polypeptides of human origin. However, it is desirable to purify HRG from recombinant 
. . cell proteins or polypeptides to obtain preparations that are substantially homogeneous as to HRG. As a first 
10 : step, the culture mediiun or lysate is centnfuged to remove particulate cell debris. The membrane and 
soluble protein fractions are then separated. HRG is then be purified from both the soluble protein fraction 
(requiring the presence of a protease) and from the membrane fraction of the culture lysate, depending on 
whether HRG is membrane bound. The following procedures are exemplary of suitable purification 
procedures: fractionation on immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase 
15 HPLC; chromatography on silica, heparin SEPHAROSE or on a cation exchange resin such as DEAE; 
chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; and gel filtration using, for example, 
SEPHADEX G-75. 

HRG variants in which residues have been deleted, inserted or substituted are recovered in the same 
fashion as the native HRG, taking account of any substantial changes in properties occasioned by the 

20 variation. For example, preparation of a HRG fusion with another protein or polypeptide, e.g., a bacterial r 
viral antigen, facilitates purification; an immunoaffinity column containing antibody to the antigen can be 
used to adsorb the fusion. Immunoaffinity coiumns such as a rabbit polyclonal ami-HRG column can be 
employed to absorb HRG variant by binding it to at least one remaining immune epitope. A protease 
inhibitor such as phenylmethylsulfonylfluoride (PMSF) also may be useful to inhibit proteolytic degradation 

25 during purification, and antibiotics may be included to prevent the growth of adventitious contaminants. One 
skilled in the art will appreciate that purification methods suitable for native HRG may require modification 
to account for changes in the character of HRG variants or upon expression in recombinant cell culture. 

H. Covaient iVlodifications of HRG 

Covalent modifications of HRG polypeptides are included within the scope of this invention. Both 
30 native HRG and amino acid sequence variants of HRG optionally are covalently modified. One type of 
covalent modification included within the scope of this invention is a HRG polypeptide fragment. HRG 
fragments, such as HRG-GDF, having up to about 40 amino acid residues are conveniently prepared by 
A chemical synthesis, or by enzymatic or chemical cleavage of the full-length HRG polypeptide or HRG 
. variant polypeptide. Other types of covalent modifications of HRG or fragments thereof are introduced into 
35 the molecule by reacting targeted amino acid residues of HRG or fragments thereof with an organic 
derivatizing agent that is capable of reacting with selected side chains or the N- or C-terminal residues. 

Cysteiny] residues most commonly are reacted with a-ha!oacetates (and corresponding amines), 
such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. 
Cysteinyl residues also are derivatized by reaction with bromotrifluoroacetone, a-bromo-P-(5- 
40 " imidozoyOpropionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2- 
pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa- 
l,3-dia2ole. 

Histidyl residues are derivatized by reaction with diethyipyrocarbonate at pH 5.5-7.0 because this 
agent is relatively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; the 
45 reaction is preferably performed in 0. 1 M sodiimi cacodylate at pH 6.0. 
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asparagine-X-serine and asparagine*X-threonine, where X is any amino acid except proline, are the 
recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. 
Thus, the presence of either of these tri-peptide sequences in a polypeptide creates a potential glycosyiation 
site. O-linked glycosyiation refers to the attachment of one of the sugars N-acetylgalactosamine, galactose, 
or xylose, to a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5- 
hydroxylysine may also be used. 

Glycosyiation sites are added to HRG by altering its amino acid sequence to contain one or more of 
the above-described tri-peptide sequences (for N- 1 inked glycosyiation sites). The alteration may also be 
made by the addition of, or substitution by, one or more serine or threonine residues to HRG (for O-linked 
glycosyiation sites). For ease, HRG is preferably altered through changes at the DNA level, particularly by 
mutating the DNA encoding HRG at preselected bases such that codons are generated that will translate into 
the desired amino acids. 

Chemical or enzymatic coupling of glycosides to HRG increases the number of carbohydrate 
substituents. These procedures are advantageous in that they do not require production of the polypeptide in 
a host cell that is capable of N- and O-linked glycosyiation. I>epending on the coupling mode used, the 
sugar(s) may be attached to (a) arginine and histidine, (b) free carboxyi groups, (c) free sulfhydryl groups 
such as those of cysteine, (d) free hydroxy! groups such as those of serine, threonine, or hydroxyproline, (e) 
aromatic residues such as those of phenylalanine, tyrosine, or tryptophan, or (f) the amide group of 
glutamine. These methods are described in WO 87/05330, published 1 1 September 1987, and in Aplin and 
Wriston (CRC Crit Rev. Biochem., pp. 259-306 (1981)). 

Carbohydrate moieties present on an HRG also are removed chemically or enzymatically. Chemical 
deglycosylation requires exposure of the polypeptide to the compound trifluoromethanesulfonic acid, or an 
equivalent compound. This treatment results in the cleavage of most or all sugars except the linking sugar 
(N-acetylglucosamine or N-acetylgalactosamine), while leaving the polypeptide intact. Chemical 
deglycosylation is described by Hakimuddin et al. {Arch, Biochem. Biophys,^ 259:52 (1987)) and by Edge et 
al, (Anal Biochem.^ 1 18:131 (1981)). Carbohydrate moieties are removed from HRG by a varjety of endo- 
and exo- glycosidases as described by Thotakura et al {Meth. EnzymoL, 138:350 (1987)). 

Glycosyiation also is suppressed by tunicamycin as described by Duskin et al {J. Biol Chem,, 
257:3105 (1982)). Tunicamycin blocks the formation of protein-N-glycoside linkages. 

HRG may also be modified by linking HRG to various nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. 4,640,835; 
4,496,689; 4,30 1 , 1 44; 4,670,4 1 7; 4,79 1 , 1 92 or 4, 1 79,337. 

One preferred way to increase the in vivo circulating half life of non-membrane bound HRG is to 
conjugate it to a polymer that confers extended half-life, such as polyethylene glycol (PEG). (Maxfield, et 
al. Polymer 16,505-509 (1975); Bailey, F. E., et aU in Nonionic Surfactants (Schick, M. J., ed.) pp.794-821, 
1967); (Abuchowski, A. et al, J. Biol. Chem. 252, 3582-3586, 1977; Abuchowski, A. et al. Cancer 
Biochem, Biophys, 7, 175-186, 1984); (Katre, N.V. et aL, Proc, Natl Acad, ScL, 84, 1487-1491, 1987; 
Goodson, R, et al. Bio Technology, 8, 343-346, 1990). Conjugation to PEG also has been reported to have 
reduced inununogenicity and toxicity (Abuchowski, A. et al, J. Biol Chem., 252, 3578-3581, 1977). 

HRG may also be entr^ped in microcapsules prepared, for example, by coacervation techniques or 
by interfacial polymerization (for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
(methylmethacylate) microcapsules, respectively), in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules), or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences, 16th edition, Osol, A., Ed., (1980). 

Those skilled in the art will be capable of screening variants in order to select the optimal variant for 
the purpose intended. For example, a change in the immunological character of HRG, such as a change in 
affinity for a given antigen or for the HER2 receptor, is measured by a competitive-type immunoassay using 
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CuJture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies 
produced by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). 

The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard 
analysis of Munson et aL, Anal. Biochem,, 107:220 (1980). 

After hybridoma ceils are identified that produce antibodies of the desired specificity, affinity, and/or 
activity, the clones may be subcloned by limiting dilution procedures and grown by standard methods 
(Coding, Monoclonal Antibodies: Principles and PracticCj pp.59-103 (Academic Press, 1986)). Suitable 
culture media for this purpose include, for example, D-MEM or RPMI-1 640 medium. In addition, the 
hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated fi-om the culture medium, 
ascites fluid, or senmi by conventional immunoglobulin purification procedures such as, for example, protein 
A-SEPHAROSE, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 
procedures (eg., by using oligonucleotide probes that are capable of binding specifically to genes encoding 
the heavy and light chains of the monoclonal antibodies). The hybridoma cells serve as a preferred source of 
such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into 
host cells such as E. coli cells, simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that 
do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. Recombinant production of antibodies will be described in more detail below. 

iiybridoma cell lines producing antibodies are identified by screening the culture supematants for 
antibody which binds to HER2/HER3 receptors. This is routinely accomplished by conventional 
immunoassays using soluble receptor preparations or by FACS using cell-bound receptor and labeled 
candidate antibody. Agonist antibodies are preferably antibodies which stimulate autophosphoryiation in the 
HRG tyrosine autophosphoryiation assay described above. 

The hybrid cell lines can be maintained in culture in vitro in cell culture media. The cell lines of 
this invention can be selected and/or maintained in a composition comprising the continuous cell !ine in 
hypoxanthine-aminoptenn thymidine (HAT) medium. In fact, once the hybridoma cell line is established, it 
can be maintained on a variety of nutritionally adequate media. Moreover, the hybrid cell lines can be stored 
and preserved in any number of conventional ways, including fineezing and storage under liquid nitrogen. 
Frozen cell lines can be revived and cultured indefinitely with, resumed synthesis and secretion of 
monoclonal antibody. The secreted antibody is recovered from tissue culture supernatant by conventional 
methods such as precipitation, ion exchange chromatography, affinity chromatography, or the like. The 
antibodies described herein are also recovered from hybridoma cell cultures by conventional methods for 
purification of IgG or IgM as the case may be that heretofore have been used to purify these 
immunoglobulins fix>m pooled plasma, e.g., ethanol or polyethylene glycol precipitation procedures. 

Human antibodies may be used and are preferable. Such antibodies can be obtained by using 
human hybridomas (Cote et aL, Monoclonal Antibodies and Cancer Therapy, R. Liss, p. 77 (1985)). 
Chimeric antibodies, Cabilly et aL, U.S. 4,816,567, (Morrison et aL, Proc, NatL Acad Sci„ 81:6851 (1984); 
Neuberger ei ai,^ Nature 312:604 (1984); Takeda et al.. Nature 314:452 (1985)) containing a murine anti- 
HER2/HER3 variable region and a human constant region of appropriate biological activity (such as ability 
to activate human complement and mediate ADCC) are within the scope of this invention, as are humanized 
antibodies produced by conventional CDR-grafting methods(Riechmann et al.. Nature 332:333-327(1988); 
EP 0328404 A 1 ; EP 02394000 A2). 

Techniques for creating recombinant DNA versions of the antigen-binding regions of antibody 
molecules (Fab or variable regions fragments) which bypass the generation of monoclonal antibodies are also 
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Th rapeutic formulations of HRG or ag nist antibody are prepared for storage by mixing the HRG 
protein having the desired degree of purity with optional physiologically acceptable carriers, excipients, or 
stabilizers (Remington's Pharmaceutical Sciences, supra), in the form of lyophilized cake or aqueous 
solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages and 
5 concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; 
. proteins, such as serum albiunin, gelatin, or immunoglobulins; hydrophilic polymers such as 
.. polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; 
: monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating 
10 ; agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; 
and/or nonionic surfactants such as TWEEN, PLURONICS or polyethylene glycol (PEG). 

HRG or agonist antibody to be used for in vivo administration must be sterile. This is readily 
accomplished by filtration through sterile nitration membranes, prior to or following lyophilization and 
reconstitution. The HRG or antibody ordinarily will be stored in lyophilized form or in solution. 
15 Therapeutic HRG or antibody compositions generally are placed into a container having a sterile 

access port, for example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic 
injection needle. 

The route of HRG or antibody administration is in accord with known methods, e.g., injection r 
infusion by intravenous, intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial, powder or 
20 liquid aerosol administration to the nose or lung or intralesional routes, or by sustained release systems as 
noted below. The HRG ligand may be administered continuously by infusion or by bolus injection. An 
agonist antibody is preferably administered in the same fashion, or by administration into the blood stream or 
lymph. 

The HRG, HRG variant or fragment and agonist antibodies may be spray dried or spray freeze dried 
25 using known techniques (Yeo et al, Biotech, and Bioeng., 41:341-346 (1993); Gombotz et al, 
PCT/US90/02421). 

Suitable examples of sustained-release preparations include semipermeable matrices of solid 
hydrophobic polymers containing the protein, which matrices are in the form of shaped articles, e.g. films, or 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (e.g,, poly(2- 

30 hydroxyethyl-methacrylate) as described by Langer ei al.^ J, BiomedL Mater. Res,, 15: 167-277 (1981) and 
Langer, Chem, Tech,, 12: 98-105 (1982) or poIy(vinyIalcohol)), polylactides (U.S. 3,773,919, EP 58,481), 
.copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et aL, Biopolymers, 22: 547-556 
(1983)), non-degradable ethylene-vinyl acetate (Langer e/ a/., supra\ degradable lactic acid-glycoiic acid 
copolymers such as the LUPRON DEPOT (injectable microspheres composed of lactic acid-glycolic acid 

35 copolymer and . leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid (EP 133,988). While polymers 
such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, 
certain hydrogels release proteins for shorter time periods. When encapsulated proteins remain in the body 
I for a long time, they may denature or aggregate as a result of exposure to moisture at 37*'C, resulting in a loss 
of biological activity and possible changes in immunogenicity. Rational strategies can be devised for protein 

40 stabilization depending on the mechanism involved. For example, if the aggregation mechanism is 
discovered to be intermolecuiar S-S bond formation through thio-disulflde interchange, stabilization may be 
achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture content, 
using appropriate additives, and developing specific polymer matrbc compositions. 

Sustained-release HRG or antibody compositions also include liposomally entrapped HRG or 

45 antibody. Liposomes containing HRG or antibody are prepared by methods known per se: DE 3,218,121; 
Epstein et al., Proc, Natl Acad Set. USA. 82: 3688-3692 (1985); Hwang et ai, Proc. NatL Acad Sci, USA, 
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L HER3-(TR)-I^*<^™ VHi HER4, G heR4-(TR)-DKTH VH- conserved TR sequence is 

derived from the Ml I site. The final expression constructs were in a pRK-type plasmid backbone wherein 
eukaryoiic expression is driven by a CMV promoter (Gorman et al, DNA Prot. Eng. Tech. 2:3-10 (1990)). 

To obtain protein for in vitro experiments, adherent HEK-293 cells were transfected with the 
5 appropriate expression plasmids using standard calcium phosphate methods (Gorman et aL. supra and Huang 
et ai. Nucleic Acids Res. 18:937-947 (1990)). Serum-containing media was replaced with serum-free media 
15 hours post-transfection and the transfected cells incubated for 5-7 days. The resulting conditioned media 
was harvested and passed through Protein A columns (1 mL Pharmacia HiTrap™). Purified IgG fusions 
were eluted with 0.1 M citric acid (pH 4,2) into tubes containing I M Tris pH 9.0. The eiuted proteins were 
10 subsequently dialyzed against PBS and concentrated using Centri-prep-30 filters (Am icon). Glycerol was 
added to a final concentration of 25% and the material stored at -20 C. Concentrations of material were 
determined via a Fc-ELISA 

Ceil Culture: Human breast cancer cell lines MDA-MB-175, MDA-MB-23U SK-BR-3 and MCF7 
were obtained from the American Type Culture Collection and maintained in a 50:50 mixture of F12 Ham*s 
15 and Dulbecco's modified Eagle medium (DMEM), supplemented with 10% heat inactivated FBS, 2 mM 
"glutamine and 10% penicillin-streptomycin. 

Generation and Characterization of cDNA Library: Total RNA was purified from MDA-MB-175 
cells using the guanidinium isothiocyanate-cesium chloride procedure (Sambrook et aL, Molecular Cloning: 
A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, (1989)). Poly (A)* RNA was 
20 isolated using oligo (dT) Dynabeads (DYNAL) as recommended by the supplier. First and second strand 

syntheses were performed using a Gibco BRL cDNA syi -thesis kit. AgtlO cDNA recombinants were 
generated when a cDNA cloning system from Amersham was used. In vitro packaging was performed using 
Gigapack II packaging extract (Stratagene). Pstl-Xhol HRGP3 cDNA fragment (ni 144-618) was labeled by 
random priming and 1 x 10* plaques were screened Positive clones were confirmed and purified by 
25 secondary and tertiary screening. Phage DNA was isolated as a BamHI fragment and subcloned into the 
corresponding site of pBluescript SK" . Clone 5 was completely sequenced using the Sequenase version 2.0 
DNA sequencing kit (United States Biochemicals, Inc.). Both stnmds were sequenced. 

Bacterial Expression System: A cDNA fragment of clone 5 (nt 1690- 2722) was subcloned into the 
pET-32 TRX fusion vector (Novagen). This Bglll-Bglll fragment was inserted into the BamHI site of the 

30 pET32a plasmid. The trxY-HRG (amino acids 455-768) protein expression in Exoli was induced as 
* recommended by the supplier. 

Purification of Recombinant y-HRG: E.coli cells expressing trxy-HRG were collected and 
suspended at 9 ml/g in 50 mM Tris HCL pH 8. Lysozyme was added to a final concentration of 0.2 mg/ml 
and the solution was stirred on ice for I hr. Dnase I (10 lig/ml) aiid MgClj (4 mM) were added. The 
35 solution was then sonicated for 30 min and cell pellets collected afterwards. The pellet fraction was 
dissolved at 250 ml/g in 6 M Gdn HCL, 0. 1 M Tris HCL, pH 8.8. Solubilized proteins were sulfitolyzed by 
adding I/IO volume of 1 M Na2S03 and 1/10 volume of 0.2 M Na2S40^. The reaction was allowed to 

proceed for 1.5 hours at room temperature and protein was purified by gel filtration chromatography using a 
High Load Superdex^ 75 prep grade column (Pharmacia). Refolding was initiated by the addition of 1 mM 
40 cysteine, and 10 mM methionine was added as an antioxidant and incubated overnight at room temperature. 
Protein concentration was determined by quantitative amino acid analysis. 

Northern and Southern Hybridization: Total RNA was isolated by the method of Chomczynski et 
al. Anal Biochem. 162:156-159 (1987). Poly (A)"*^ was isolated using oligo d(T) cellulose columns (Qiagen) 
as recommended by the supplier. RNA was denatured and size fractioned in a 0.8% formaldehyde/ 1% 
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seeded in T175 flasks and crown until reaching 70-80% confluency ( - 2.5 x 10^ ceils/ flask). Subsequently, 

ceils were washed with PBS and grown in serum free F12/ DMEM medium for 3-4 days. Medium was then 
collected, filtered and concentrated using an ultraflitration ceil with YMiO Diaflo uitaflitration membranes 
(Amicon). y-HRG was visualized in conditioned medium of MDA-MB-175 cells by Western blot analysis 
5 under non reducing conditions. y-HRG was partially purified by HPLC using a C4 reverse phase column. 
CHO expressed full. length HRGpl (lane i) and semi pure y-HRG (lane 2) were electrophoresed, blot was 
probed with HER2/HER4 IgG heterodimers and Western blot was developed A - 64 kDa band could be 
seen in the lane containing partial purifled supernatant whereas CHO expressed fiiil length HRGpi migrated 
as a 45 kDa protein. 

10 Cell Proliferation Assay with Crystal Violet: Tumor cell lines were plated in 96 well plates at 

following densities: 2x10"* cells/well for MDA-MB-175 and 1 x lo"* cells/well for SK-BR-3. The media 
contained 1% FBS and cells were allowed to adhere for 2 hours. Monoclonal antibodies, immunoadhesions 
(10 ng/ml) or media alone were added and the cells were incubated for 2 hours at 37;*C. rHRGPl 177.244 was 
added at a final concentration of 1 nM, or 100 nM for neutralising the immunoadhesion, and the ceils were 

15 incubated for 4 days. Monolayers were washed with PBS and stained/fixed with 0.5% crystal violet. Plates 
were air dried, the dye was eluted with 0. 1 M sodium citrate (pH 4.2) in ethanol (50:50) and the absorbance 
was measured at 540 nm. 

Isolation and sequence analysis of y-HRG: To characterize the heregulin transcript in MDA-MB- 
175 cells» a kgt 10 cDNA library was constructed with mRNA derived from this ceil line. The library was 

20 screened with a cDNA probe corresponding to the EGF-like domain and part of the N-terminal sequence of 
HRGp3. Various clones were identified. One of the clones which appeared to contain the full length cDNA 
sequence was isolated and sequenced. Fig.7A-7D shows the nucleotide sequence and the predicted amino 
acid sequence of y-HRG. The single open reading frame of 2303 bp starts with an ATG codon at nt 334. 
This start codon lies in a nucleotide sequence context, which is known to be a potential translation initiation 

25 site (Kozak, Nucleic Acid Research 15:8125-8148 (1987)). Several termination codons were found upstream 
of the initiation codon. The stop codon TAG at nt 2637 is followed by the 3' noncoding sequence, which is 
identical to other HRG isoform sequences and includes a polyadenylation signal followed by an A-rich 
region. The open reading frame encodes a protein of 768 amino acid residues with a calculated molecular 
massof 84.2 kDa. 

30 (d) The selection of HRG-pi variants containing residues corresponding to the minimal EGF-like 

domain (HRG-pi 177-228) was conducted using monovalent phage display. For these variants, residue 
numbers also are expressed, in parentheses, in terms of the position of the residue in the minimal EGF-like 
domain (i.e., HRG-P 1 EGF I -52). 

Variants of HRG-P I EGF were prepared and selected for binding to HER-3-lg using monovalent 
35 phage display, according to the method of Bass et al.^ Proteins 8:309-314 (1990). As discussed in detail 
below, an HRG-P I EGF phagemid vector was prepared, in which HRG-P 1 EGF was fused to a C-terminal 
fragment of the M13 coat protein pill. Kunkel mutagenesis was performed to introduce stop codons into this 
vector at sites selected for randomization. This step ensures that the starting vector is incapable of 
expressing the wild-type polypeptide. Stretches of four to six residues per library were randomized in a 

40 linear fashion, except for the six cysteines, Phel89 (HRG-pi EGF Phel3) and the two most C-terminal 
residues. Phel89 was not altered because this residue is conserved as an aromatic residue in EGF and TGF-a 
and forms a stacking interaction with Tyr208 (HRG-P I EGF Tyr32) Jacobsen et aL. Biochemistry 35:3402- 
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Table 4: 
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Table 5: 


Library D Variants 










Position in HRG-pl 
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197 


198 


199 


200 


201 




Wild-type 


F 


M 


V 


K 


D 




I*, 2*, 8*, 12* 


Y 


K 




R 


I 


1 s 


3 




R 






T 




4, 5, 7, 9 


Y 


R 






T 




6 


Y 




i 




Y 




10 


Y 








T 

1 




11 


M 


R 




R 


T 


20 
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Table 6: 


Library E Variants 










Position in HRG-pl 
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Y 
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K 


Y 
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Y 
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♦Variant also contained Met226Ile. 














Table 7: Library G Variants 








Position in HRG-pi 






Variant No. 


211 


212 


213 


214 


215 


40 


Wild-type 
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R 
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R 
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45 
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N 
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ExanriDle 2 - HER2/HER3 exoression in embrvonic rat June 





50 Rat lungs were microdissected from rat embryos on embryologic day (E) 16, 18^ 20, and post-natal 

(P) days 7, 14, and adult. Isolated lung tissue was homogenized in a standard protease inhibitor buffer, and 
equal protein amounts subjected to SDS-PAGE (4-20%), blotted to nitrocellulose and identified with specific 
antibodies {HER2, HER3, HER4-Santa Cruz Biological, San Jose, CA, and HRG, 3G1 1, Genentech, Inc.) 
using chemiluminescent techniques. 

55 Analysis of the blot indicates that HER2 is expressed at high levels throughout development in 

utero, appears to peak on El 8 and declines after birth to lower adult levels. HER3 expression peaks at El 8, 



wo 99/39729 



PCT/US99/02390 



containing overhangs. The fragment was inserted into phagemid display vector pam-g3 by restriction digest- 
iigation at the same sites to generate construct pHRG2-g3 (177-244). pain-g3 was a derivative of phGHam- 
g3, which was designed for phage display of human growth hormone (hGH) and was described in Lowman 
et al.. Biochemistry 30:10832-10838 (1991). pam-g3 was produced by removing the hGH gene present in 
5 phGHam-g3 and replacing this gene with a stuffer fragment, which provides space for cleavage at the 
restriction sites used for cloning. The HRG-pi fragment was attached to residue 247 of pHL 

The HRG-Pl EGF-like domain expressed from the above-described construct is designated by 
removing the "p" and the "-g3" that appear in the name of the construct. Thus, the HRG-Pl EGF-like 
domain expressed from the pHRG2-g3 construct is designated ''HRG2." 
10 The domain was displayed monovaiently on phage as a plII frision protein, as described by Bass et 

al.. Proteins 8:309-3 14 (1990). 

Similarly, variants HRG-P I j 47.227. HRG-pi 147.244 » and HRG-pi 177.227 were prepared and 
expressed as described above. 

Example 6 - rHRGBl |77_2. |^ causes accelerated lung deveiopment 

15 Exogenous rHRGpi 177.244 caused accelerated lung development in vitro. To determine if the 

expressed HER2/HER3 receptors were ftinctional and the role of HRG stimulation during lung development 
in vitro, rHRGP 1 177.244 was added to the in vitro culture at 10 nM. Tissue was harvested at D5, snap 

frozen, and 5 micron sections were cut for analysis. 

The morphology, in comparison to the untreated control specimens was grossly different- There 
20 was marked proliferation of the epithelium. Air spaces which are typically lined with a single cell layer now 

had a 2-3 cell thickness. The changes were dose dependent with more epithelial cell response with higher 

concentrations. HER2 and HER3 were still identifiable in the epithelium only. 

Example 7 - Human lung differentiation 

Differentiation of human lung epithelial cells occurs after HRG treatment. Differentiation was 
25 measured by Surfactant Protein A (SPA) production. All sections were stained for SPA. Human lung 

expiant stained for SPA with stain localizing in epithelial cells of the prealveolar ducts. Human lung expiant 

exposed to 10 nM HRG showed an effect on SPA production. As a differentiation control, a lung expiant 

was exposed to ImM dibutyryl cAMP. A negative control was also run. 
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17. The method of Claim 16, wherein the activating ligand is a heregulin (HRG) polypeptide, HRG 
variant, HRG agonist antibody or fragment thereof capable of binding to the HER2, HERS and/or HER4 
receptor. 

1 8. A method of treating respiratory distress or emphysema, comprising administering to a patient in 
need thereof an effective amount of an isolated HER2, HERS and/or HER4 activating ligand. 

19. The method of Claim 18, v^herein the activating ligand is a heregulin (HRG) polypeptide, HRG 
variant, HRG agonist antibody or fragment thereof capable of binding to the HER2, HERS and/or HER4 
receptor. 

20. A method, comprising the steps of: 

(a) obtaining a normal epithelial cell sample from a mammal; 

(b) contacting the sample with a ligand which activates HER2, HERS, HER4 or a combination 
thereof to induce growth and/or proliferation of epithelial cells in the sample and to obtain an expanded 
sample; and 

(c) re-introducing the expanded sample into the mammal. 
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GG GCG CGA GCG CCT CAG CGC GGC CGC TCG CTC TCC CCC 38 
Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro 
1 5 10 

TCG AGG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT 77 
Ser Ar^ Asp Lys Leu Phe Pro Asn Pro He Arg Ala Leu 
IS 20 25 

GGA CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG 116 
Gly Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu 

30 35 

CGC TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC 155 
Arg Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly 
40 45 50 

AGA GGC AAA GGG AAG GGC AA6 AAG AAG GAG CGA GGC TCC 194 
Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser 
55 60 

GGC AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA 233 
Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro 
65 70 75 

GCC TTG CCT CCC CGA TTG AAA GAG ATG AAA AGC CAG GAA 272 
Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 65 90 

TCG GCT GCA GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC 311 
Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr 

95 100 

AGT TCT GAA TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG 350 
Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys 
105 110 115 

AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT 389 
Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn 
120 125 

ATC AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC 428 
He Lys He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
130 135 140 

ATT AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG 467 
He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
145 150 155 

TGC AAA GTG ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT 506 
Cys Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser 

160 165 
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GCC AAT ATC ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT 545 
Ala Asn lie Thr lie Val Glu Ser Asn Glu lie He Thr 
170 175 180 

GGT ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA 584 
Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
185 190 

GAG TCT CCC ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA 623 
Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala 
195 200 205 

AAT ACT TCT TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA 662 
Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
210 215 220 

AGC CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC 701 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe 

225 230 

TGT GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT 740 
Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu 
235 240 245 

TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC CAA CCT GGA 779 
Ser Asn Pro Ser Arg Tyr Leu Cys Lys cys Gin Pro Gly 
250 255 

TTC ACT GGA GCA AGA TGT ACT GAG AAT GTG CCC ATG AAA 818 
Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
260 265 270 

GTC CAA AAC CAA GAA AAG GCG GAG GAG CTG TAC CAG AAG 857 
Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
275 280 285 

AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC GCC CTC CTT 896 
Arg Val Leu Thr He Thr Gly He cys He Ala Leu Leu 

290 295 

GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC TGC AAA ACC 935 
Val Val Gly He Met Cys Val Val Ala Tyr cys Lys Thr 
300 305 310 

AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT CTT CGG CAG 974 
Lys Lys Gin Arg Lys Lys Leu His Tisp Arg Leu Arg Gin 
315 320 

AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG AAC ATT GCC 1013 

Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn Zle Ala 
325 330 335 
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AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC GAG AAT GTC 1052 
Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val 
340 345 350 

GAG CTG GTG AAT CAA TAG GTA TOT AAA AAC GTC ATC TCC 1091 
Gin liCu Val Asn Gin Tyr Val Ser Lys Asn Val lie ser 

355 360 

ACT GAG CAT ATT GTT GAG AGA GAA GCA GAG ACA TCC TTT 1130 
Ser Glu His lie Val Glu Arg Glu Ala Glu Thr Ser Phe 
365 370 375 

TCC ACC AGT CAC TAT ACT TCC ACA GCC CAT CAC TCC ACT 1169 
Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr 
380 385 

ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG AGC AAC GGA 1208 
Thr Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly 
390 395 ^00 

CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC TCT GTA ATC 1247 
His Thr Glu Ser He Leu Ser Glu Ser His Ser Val He 
405 410 415 

GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC CCA 1286 
Val Met Ser Ser Val Glu Asn Ser Arg His ser Ser Pro 

420 425 

ACT GGG GGC CCA AGA GGA CGT CTT AAT GGC ACA GGA GGC- 1325 
Thr Gly Gly Pro Arg Gly Arg i«eu Asn Gly Thr Gly Gly 
430 435 440 

CCT CGT GAA TGT fAC AGC TTC CTC AGG CAT GCC AGA GAA 1364 
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu 
445 450 

ACC CCT GAT TCC TAC CGA GAC TCT CCT CAT AGT GAA AGG 1403 
Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
455 460 465 

TAT GTG TCA GCC ATG ACC ACC CCG GCT CGT ATG TCA CCT 1442 

Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 475 480 

GTA GAT TTC CAC ACG CCA AGC TCC CCC AAA TCG CCC CCT 1481 
Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro 

485 490 

TCG GAA ATG TCT CCA CCC GTG TCC AGC ATG ACG GTG TCC 1520 
Ser Glu Met S r Pro Pro Val Ser Ser M t Thr Val Ser 
495 500 505 
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ATG CCT TCC ATG GCG GTC AGC CCC TTC ATG GAA GAA GAG 1559 
Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 
510 515 

AGA CCT CTA CTT CTC GTG ACA CCA CCA AGG CTG CGG GAG 1598 
Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu 
520 525 530 

AAG AAG TTT GAC CAT CAC CCT CAG CAG TTC AGC TCC TTC 1637 
Lys Lys Phe Asp His His Pro Gin Gin Phe Ser Ser Phe 
535 540 545 

CAC CAC MiC CCC GCG CAT GAC AGT AAC AGC CTC CCT GCT 1676 
His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala 

550 555 

AGC CCC TTG AGG ATA GTG GAG GAT GAG GAG TAT GAA ACG 1715 
Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr 
560 565 570 

ACC CAA GAG TAC GAG CCA GCC CAA GAG CCT GTT AAG AAA 1754 
Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro val Lys Lys 
575 580 

CTC GCC AAT AGC CGG CGG GCC AAA AGA ACC AAG CCC AAT 1793 
Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn 
585 590 595 

GGC CAC ATT GCT AAC AGA TTG GAA GTG GAC AGC AAC ACA -1832 
Gly His lie Ala Asn Arg Leu Glu val Asp Ser Asn Thr 
600 605 610 

AGC TCC CAG AGC AGT AAC TCA GAG AGT GAA ACA GAA GAT 1871 
Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp 

615 620 

GAA AGA GTA GGT GAA GAT ACG CCT TTC CTG GGC ATA CAG 1910 
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly lie Gin 
625 630 635 

AAC CCC CTG GCA GCC AGT CTT GAG GCA ACA CCT GCC TTC 1949 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe 
640 645 

CGC CTG GCT GAC AGC AGG ACT AAC CCA GCA GGC CGC TTC 1988 
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
650 655 660 

TCG ACA CAG GAA GAA ATC CAG G 2010 
Ser Thr Gin Glu Glu He Gin 
665 669 
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GG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT GGA 38 
Asp Lys ZjCu phe Pro Asn Pro He Arg Ala Leu Gly 
15 10 



CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG CGC 77 
Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC AGA 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly Arg 

30 35 

GGC AAA GGG AAG GGC AA6 AAG AAG GAG CGA GGC TCC GGC 155 
Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
40 45 50 

AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA GCC 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
55 60 

TTG CCT CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG 233 
Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser 
65 70 75 

GCT GCA GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC AGT 272 
Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 90 

TCT GAA TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG AAT 311 
Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 

95 100 

GGG AAT GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT ATC 350 
Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He 
105 110 115 

AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC ATT 389 
Lys He Gin Lys Lys Pro Gly Lys Ser Glu Leu /org He 
120 125 

AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC 428 
Asn Lys Ala ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
130 135 140 

AAA GTG ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC 467 
Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 
145 150 155 

AAT ATC ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT GGT 506 
Asn He Thr He Val Glu Ser Asn Glu H He Thr Gly 

160 165 
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ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG 545 
Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
170 175 180 

TCT CCC ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA AAT 584 
Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

ACT TCT TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA AGC 623 
Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr ser 
195 200 205 

CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT 662 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 

GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA 701 
Val Asn Gly Gly Glu Cys Phe Met val Lys Asp Leu Ser 

225 230 

AAC CCC TCG AGA TAC TTG TGC AAG TGC CCA AAT GAG TTT 740 
Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
235 240 245 

ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 779 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

TAC AAG CAT CTT GGG ATT GAA TTT ATG GAG GCG GAG GAG- 818 
Tyr Lys His Leu Gly He Glu Phe Met Glu Ala. Glu Glu 
260 265 270 

CTG TAC CAG AAG AGA GTG CTG ACC ATA ACC GGC ATC TGC 857 

Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC 896 
He Ala Leu Leu Val Val Gly He Met cys Val Val Ala 

290 295 

TAC TCC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC 935 
TVr Cys LYS Thr Lys Lys Gin Arg Lys Lys Leu His Asp 
300 305 310 

CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA AAC AAT ATC 974 
Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met 
315 320 

ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC 1013 

net Asn He Ala KsH Gly Pro His His Pro Asn Pro Pro 
325 330 335 
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CCC GAG AAT GTC CAG CTG GTG AAT CAA TAG GTA TCT AAA 1052 
pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
340 345 350 

AAC GTC ATC TCC ACT GAG CAT ATT GTT GAG AGA GAA GCA 1091 
Asn Val lie Ser Ser Glu His He Val Glu Arg Glu Ala 

355 360 

GAG ACA TCC TTT TCC ACC AGT CAC TAT ACT TCC ACA GCC 1130 
Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala 
365 370 375 

CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC 1169 
His His ser Thr Thr Val Thr Gin Thr Pro Ser His Ser 
380 385 

TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC 1208 
Trp ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser 
390 395 400 

CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG 1247 
His ser val He Val Met Ser Ser Val Glu Asn Ser Arg 
405 410 415 

CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT 1286 
His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 

420 425 

GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG" 1325 
Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
430 435 440 

CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT 1364 
His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro 
445 450 

CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC ACC CCG GCT 1403 
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
455 460 465 

CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA AGC TCC CCC 1442 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 

AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC GTG TCC AGC 1481 
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser 

485 490 

ATG ACG GTG TCC ATG CCT TCC ATG GCG GTC AGC CCC TTC 1520 
Met Thr Val Ser Met Pro Ser Met Ala Val S r Pro Phe 
495 SCO 505 
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ATO GAA OAA GAG AGA CCT CTA CTT CTC GTO ACA CCA CCA 1559 
Met Glu aiu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro 
510 515 

AGG CTG CGG GAG AAG AAG TTT GAC CAT CAC CCT CAG CAG 1596 
Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin 
520 525 530 

TTC AGC TCC TTC CAC CAC AAC CCC GCG CAT GAC AGT AAC 1637 
Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn 
535 540 545 

AGC CTC CCT GCT AGC CCC TTG AGG ATA GTG GAG GAT GAG 1676 
Ser Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu 

550 555 

GAG TAT QAA ACG ACC CAA GAG TAC GAG CCA GCC CAA GAG 1715 
Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu 
560 565 570 

CCT GTT AAG AAA CTC GCC AAT AGC CGG CGG GCC AAA AGA 1754 
Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg 
575 580 

ACC AAG CCC AAT GGC CAC ATT GCT AAC AGA TTG GAA GTG 1793 
Thr Lys Pro Asn Gly His lie Ala Asn Arg Leu Glu Val 
585 590 595 

GAC AGC AAC ACA AGC TCC CAG AGC AGT AAC TCA GAG AGT' 1832 
Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser 
600 605 610 
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GAA ACA GAA GAT GAA AGA GTA GGT GAA GAT ACG CCT TTC 1871 
Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 

615 620 

CT6 GGC ATA CAG AAC CCC CTG GCA GCC AGT CTT GAG GCA 1910 
Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
625 630 635 

ACA CCT GCC TTC CGC CTG GCT GAC AGO AGG ACT AAC CCA 1949 
Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro 
640 645 

GCA GGC CGC TTC TCG ACA CAG GAA GAA ATC CAG GCC AGG 1988 
Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin Ala Arg 
650 655 660 

CTG TCT AGT GTA ATT GCT AAC CAA GAC CCT ATT GCT GTA TA 2029 
Leu Ser Ser Val He Ala Asn Gin Asp Pro He Ala val 
665 670 675 

A AACCTAAATA AACACATAGA TTCACCTGTA AAACTTTATT 2070 

TTATATAATA AAGTATTCCA CCTTAAATTA AACAATTTAT TTTATTTTAG 2120 

CAGTTCTGCA AATAGAAAAC AGGAAAAAAA CTTTTATAAA TTAAATATAT 2170 

GTAT6TAAAA ATGAAAAAAA AAAAAAAAA 2199 
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GTGGCTGCGG GGCAATTGAA AAAGAGCCGG CGAGGACTTC CCXX5AAACTT SO 
GTTGGAACTC CGGGCTCGCG CGGAGGCCAG GAGCTGAGCG GCGGCGGCTG 100 
CCGGACGATG GGAGCGTGAG CAGGACGGTG ATAACCTCTC CCCGATCGGG 150 
TTGCGAGGGC GCCGGGCAGA GGCCAGGACG CGAGCCGCCA GCGGCGGGAC 200 
CCATCGAC6A CTTCCXXK3GG CGACAGGAGC AGCCCCGAGA GCCAGGGCGA 250 
GCGCCCGTTC CAGGTGGCCG GACCGCCCGC CGCGTCCGCG CCGCGCTCCC 300 



TGCAGGCAAC GGGAGACGCC CCCGCGCAGC GCGAGCGCCT CAGCGCGGCC 350 



GCTCGCTCTC CCCATCGAGG GACAAACTTT TCCCAAACCC GATCCGAGCC 400 



CTTGGACCAA ACTCGCCTGC GCCGAGAGCC GTCCGCGTAG AGCGCTCCGT 450 



CTCCGGCGAG ATG TCC GAG CGC AAA GAA GGC AGA GGC AAA 490 
Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 
1 5 10 ' 

GGG AAG GGC AAG AAG AAG GAG CGA GGC TCC GGC AAG AAG 529 
Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 

15 20 

CCG GAG ICC GCG GCG GGC AGC CAG AGC CCA GCC TTG CCT 568 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 35 

CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 45 

GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC AGT TCT GAA 646 
Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
50 55 60 

TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG AAT GGG AAT 685 
Tyr Ser Ser Leu Arg Ph Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT ATC AAG ATA 724 
Glu Z«eu Asn Arg Lys Asn Lys Pro Gin Asn Zle Lys Zle 

80 85 
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CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC ATT AAC AAA 763 
Gin l#y I*ys Pro Gly I*ys S r Glu I*eu Arg II Asn Lya 
90 95 100 

GCA TCA CTG GCT GAT TOT GGA GAG TAT ATG TGC AAA GTG 802 
Ala Ser I-eu Ala Ksp Ser Gly Glu Tyr Met Cya Lys Val 
105 110 

ATC AGC AAA TTA GGA AAT GAG AGT GCC TCT GCC AAT ATC 841 
lie Ser Lya Leu Gly Asn Aap Ser Ala Ser Ala Asn He 
115 120 125 

ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT GGT ATG CCA 880 
Thr He Val Glu Ser Asn Glu He He Thr Gly Met Pro 
130 135 140 

GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG TCT CCC 919 
Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 

145 ISO 

ATT AG A ATA TCA GTA TCC ACA GAA GGA GCA AAT ACT TCT 958 
He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175 

GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT 1036 
Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn , 
180 185 190 

GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA AAC CCC 1075 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro 
195 200 205 

TCG AGA TAC TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT Ill4 
Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly 

210 215 

GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AAG 1153 
Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys 
220 225 230 

GCG GAG GAG CTG TAC CAG AAG AGA GTG CTG ACC ATA ACC 1192 
Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr 
235 240 

GGC ATC TGC ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT 1231 
Gly He Cys He Ala Leu Leu Val Val Gly He Met Cya 
245 250 255 

GTG GTG GCC TAC TGC AAA ACC AAG AAA CAG CGG AAA AAG 1270 
Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 
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CTG CAT GAC CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA 1309 
l^u His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 

275 280 

AAC AAT ATG ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT 1348 
Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 
285 290 295 

AAC CCA CCC CCC GAG AAT GTC GAG CTG GTG AAT CAA TAG 1387 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr 
300 305 

GTA TCT AAA AAC GTC ATC TCC AGT GAG CAT ATT GTT GAG 1426 
Val Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu 
310 315 320 

AGA GAA GCA GAG ACA TCC TTT TCC ACC AGT CAC TAT ACT 1465 
Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr 
325 320 335 

TCC ACA GCC CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT 1504 
Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 

340 345 

AGC CAC AGC TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT 1543 
Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He Leu 
350 355 360 

TCC GAA AGC CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA 1582 
Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu ^ 
365 370 

AAC AGT AGG CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA 1621 
Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly 
375 380 385 

CGT CTT AAT GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC 1660 
Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser 
390 395 400 

TTC CTC AGG CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA 1699 
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg 

405 410 

GAC TCT CCT CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC 1738 
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr 
415 420 425 

ACC CCG GCT CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA 1777 
Thr Pr Ala Arg Met Ser Pro Val Asp Phe His Thr Pr 
430 435 

AGC TCC CCC AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC 1816 
Ser Ser Pr Lys Ser Pro Pro Ser Glu Met Ser Pro Pro 
440 445 450 
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GTG TCX: AGC ATG ACG GTG TCC AAG CCT TCC ATG GCG GTC 1855 
Val Ser Ser Met Thr Val Ser Lya Pro Ser Ket Ala Val 
455 460 465 

AGC CCC TTC ATG GAA GAA GAG AGA CCT CTA CTT CTC GTG 1894 
Ser Pro Pbe Met Glu Glu Glu Arg Pro I-eu Leu Leu Val 

470 475 

ACA CCA CCA AGG CTG CGG GAG AAG AAG TTT GAC CAT CAC 1933 
Thr Pro Pro Arg Leu Arg Glu Lys Lya Phe Aap Eia Hia 
480 * 485 490 

CCT CAG CAG TTC AGC TCC TTC CAC CAC AAC CCC GCG CAT 1972 
Pro Gin Gin Phe Ser Ser Phe His His Asn Pro Ala His 
495 500 

GAC AGT AAC AGC CTC CCT GCT AGC CCC TTG AGG ATA GTG 2011 
Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
505 SIO 515 

GAG GAT GAG GAG TAT GAA ACG ACC CAA GAG TAC GAG CCA 2050 
Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro 
520 525 530 

GCC CAA GAG CCT GTT AAG AAA CTC GCC AAT AGC CGG CGG 2089 
Ala Gin Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg 

535 540 

GCC AAA AGA ACC AAG CCC AAT GGC CAC ATT GCT AAC AGA 2128 
Ala Lys Arg Thr Lys Pro Asn Gly His He Ala Asn Arg* 
545 550 555 

TTG GAA GTG GAC AGC AAC ACA AGC TCC CAG AGC AGT AAC 2167 
Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 

TCA GAG AGT GAA ACA GAA GAT GAA AGA GTA GGT GAA GAT 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 

ACG CCT TTC CTG GGC ATA CAG AAC CCC CTG GCA GCC AGT 2245 
Thr Pro Phe Leu Gly He Gin Asn Pro Leu Ala Ala Ser 
585 590 595 

CTT GAG GCA ACA CCT GCC TTC CGC CTG GCT GAC AGC AGG 2284 
l3eu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 

600 605 

ACT AAC CCA GCA GGC CGC TTC TCG ACA CAG GAA GAA ATC 2323 
Thr Asn Pro Ala Gly Arg Ph Ser Thr Gin Glu Glu He 
610 615 620 

CAG GCC AGG CTG TCT AGT GTA ATT GCT AAC CAA GAC CCT 2362 
Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin Asp Pro 
625 630 

FIG.-3D 

SUBSTITUTE SHEET (RULE 26) 



wo 99/39729 



PCTAJS99/02390 



14/29 



ATT GCT GTA TAAAACCTA AATAAACACA TAGATTCACC TGTAAAACTT 2410 
He Ala Val 
635 637 

TATTTTATAT AATAAAGTAT TCCACCTTAA ATTAAACAAT TTATTTTATT 2460 



TTAGCAGTTC TGCAAATAAA AAAAAAAAAA 2490 
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gcgcx:tgcct ccaacctgcg ggcgggaggt gggtggctgc ggggcaattg so 
aaaaagagcc ggcgaggagt tccccgaaac ttgttggaac tccgggctcg 100 
cgcggaggcc aggagctgag cggcggcggc tgccggacga tgggagcgtg 150 
agcaggacgg tgataacctc tccccgatcg ggttgcgagg gcgccgggca 200 
gaggccagga cgcgagccgc cavgcggcggg acccatcgac gacttcx:cgg 250 
ggcgacagga gcagccccga gagccagggc gagcgcccgt tccaggtggc 300 
cggaccgccc gccgcgtccg cgccgcgctc cctgcaggca acgggagacg 350 

CCCCCGCGCA GCGCGAGCGC CTCAGCGCGG CCGCTCGCTC TCCCCATCGA 400 

GGGACAAACT TTTCCCAAAC CCGATCCGAG CCCTTGGACC AAACTCGCCT 450 

GCGCCGAGAG CCGTCCGCGT AGAGCGCTCC GTCTCCGGCG AG ATG 495 

Met 

1 - 

TCC GAG CGC AAA GAA GGC AGA GGC AAA GGG AAG GGC AAG 534 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly I*ys Gly Lya 
5 10 

AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG 573 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

GCG GGC AGC GAG AGC CCA GCC TTG CCT CCC CAA TTG AAA €12 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT TCC AAA CTA 651 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 

45 50 

GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 690 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
SS 60 65 

AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT OGA 729 
Arg Phe Lys Trp Phe Lys Asn Gly Aan Glu Leu Asn Arg 
70 ^5 
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AAA AAC AAA CCA CAA AAT ATC AAS ATA CAA AAA AAG CCA 768 
Lys Asn Lya Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 807 
Gly Lys Ser Glu Leu Arg lie Aan Lys Ala Ser Leu Ala 
95 100 105 

GAT TCT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 846 
Asp Ser Gly Glu Tyr Met Oys Lys Val lie Ser Lys Leu 

110 115 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 885 
Gly Ash Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 924 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 963 
Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

GTA TCC ACA GAA GGA GCA AAT ACT TCT TCA TCT ACA TCT 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1041 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala' 

175 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1080 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC 1197 
Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

TTT CTG TCT CTG CCT GAA TAGGA GCATGCTCAG TTGGTGCTGC 1240 
Phe Leu Ser Leu Pro Glu 

240 241 

TTTCTTGTTG CTGCATCTCC CCTCAGATTC CACCTAGAGC TAGATGTGTC 1290 
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TTACCAGATC TAATATTGAC TGCCTCTGCC TGTCGCATGA GAACATTAAC 1340 
AAAAGCAATT GTATTACTTC CTCTGTTCGC GACTAGTTGG CTCTGAGATA 1390 
CTAATAGGTG TGTGAGGCTC CGGATGTTTC TGGAATTGAT ATTGAATGAT 1440 
GTGATACAAA TTGATAGTCA ATATCAAGCA GTGAAATATG ATAATAAAGG 1490 
CATTTCAAAG TCTCACTTTT ATTGATAAAA TAAAAATCAT TCTACTGTIAC 1540 
AGTCCATCTT CTTTATACAA TGACCACATC CTGAAAAGGG TGTTGCTAAG 1590 
CTGTAACCGA TATGCACTTG AAATGATGGT AAGTTAATTT TGATTCAGAA 1640 
TGTGTTATTT GTCACAAATA AACATAATAA AAGGAGTTCA GATGTTTTTC 1690 
TTCATTAACC AAAAAAAAAA AAAAA 1715 
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GAGGCGCCTG CCTCCAACCT GCGGGCGGGA GGTGGGTGGC TGCGGGGCAA 50 
TT6AAAAAGA GCCGGCGAGG AGTTCCCCGA AACTTGTTGG AACTCCGGGC 100 
TCGCGCGGAG GCCAGGAGCT GAGCGGCGGC GGCTGCCGGA CGATGGGAGC 150 
GTGAGCAGGA CGGTGATAAC CTCTCCCCGA tTCGGGXTGCG AGGGCGCCGG 200 



GCAGAGGCCA GGACGCGAGC CGCCAGCGGC GGGACCCATC GACGACXTCC 250 



CGGGGCGACA GGAGCAGCCC CGAGAGCCAG GGCGAGCGCC CGTTCCAGGT 300 
GGCCGGACCG CCCGCCGCGT CCGCGCCGCG CTCCCTGCAG GCAACGGGAG 350 
ACGCCCCCGC GCAGCGCGAG CGCCTCAGCG CGGCCGCTCG CTCTCCCCAT 400 



CGAGGGACAA ACTTTTCCCA AACCCGATCC GAGCCCTTGG ACCAAACTCG 450 



CCTGCGCCGA GAGCCGTCCG CGTAGAGCGC TCCGTCTCCG GCGAG AT 497 

Met 

- 1 

G TCC GAG CGC AAA GAA GGC AGA GGC AAA GGG AAG GGC AAG 537 
Ser Glu Axg Lys Glu Gly Arg Gly Lya Gly Lyz Gly Lys 
5 10 

AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG 576 
Lys I*ys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

GCG GGC AGC GAG AGC CCA GCC TTG CCT CCC CAA TTG AAA 615 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

GAG ATG AAA AGC CA6 GAA TCG GCT GCA GGT TCC AAA CTA 654 
GlcL Met: Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 

45 50 

GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 693 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

AGA TTG AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA 732 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 
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AAA AAC AAA CCA CAA AAT ATC AA6 ATA CAA AAA AA6 CCA 771 
Lys Asn Lys Pro Gin Asn lie Lys Zle Gin Lys Lys Pro 
80 es 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 810 
Gly Lys Ser Glu Leu Arg Zle Asn Lys Ala Ser I«eu Ala 
95 100 105 

GAT TOT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 849 
Asp Ser Gly Glu Tyr Met Cys Lys Val Zle Ser Lys Leu 

110 115 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 688 
Gly Asn Asp Ser Ala Ser Ala Asn Zle Thr Zle Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 927 
Ser Asn Glu Zle Zle Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 966 
Gly Ala Tyr Val Ser Ser Glu Ser Pro Zle Ar<g Zle Ser 
145 150 155 

GTA TCC ACA GAA GGA GCA AAT ACT TCT TCA TCT ACA TCT 1005 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1044 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 

175 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1083 
Glu Lys Glu Lys Thr Phe Cya Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg- Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1161 
Cys Lys pys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AAG GCG GAG GAG CTG 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

TAC CAG AAG AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC 1239 
Tyr Gin Lys Arg Val Leu Thr Zle Thr Gly Zl Qys Zle 

240 245 

GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC 1278 
Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala Tyr 
250 255 260 
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TGC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT 1317 
Cya Lys Thr Lys Lys Gin Arg Lya Lys l<ou His Asp Arg 
265 270 

CTT CGG GAG AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG 1356 
Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 
275 280 285 

AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC 1395 
Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

GAG AAT GTC CAG CTG GTG AAT CAA TAC GTA TCT AAA AAC 1434 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 

305 310 

GTC ATC TCC AGT GAG CAT ATT GTT GAG AGA GAA GCA GAG 1473 
Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu 
315 320 325 

ACA TCC TTT TCC ACC AGT CAC TAT ACT TCC ACA GCC CAT 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG 1551 
His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
340 345 350 

AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC 1590 
Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser His 
355 360 365 

TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC 1629 
Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg His 

370 375 

AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT GGC 1668 
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly 
380 385 390 

ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT 1707 
Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT CAT 1746 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 
405 410 415 

AGT GAA AGG TAAAA CCGAAGGCAA AGCTACTGCA GAGGAGAAAC 1790 
Ser Glu Arg 
420 
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TCAGTCAGAG AATCCCTGTG AGCACCTGCG GTCTCACCTC AGGAAATCTA 1840 
CTCTAATCAG AATAAGGGGC GGCAGTTACC TGTTCTAGGA GTGCTCCTAG 1890 
TTGATGAAGT CATCTCTTTG TTTGACGGAA CTTATTTCTT CTGAGCTTCT 1940 
CTCGTCGTCC CAGTGACTGA CAGGCAACAG ACTCTTAAAG AGCTGGGATG 1990 
CTTTGATGCG GAAGGTGCAG CACATGGAGT TTCCAGCTCT GGCCATGGGC 2040 
TCAGACCCAC TCGGGGTCTC AGTGTCCTCA GTTGTAACAT TAGAGAGATG 2090 
GCATCAATGC TTGATAAGGA CCCTTCTATA ATTCCAATTG CCAGTTATCC 2140 
AAACTCTGAT TCGGTGGTCG AGCTGGCCTC GTGTTCTTAT CTGCTAACCC 2190 
TGTCTTACCT TCCAGCCTCA GTTAAGTCAA ATCAAGGGCT ATGTCATTGC 2240 
TGAATGTCAT GGGGGGCAAC TGCTTGCCCT CCACCCTATA GTATCTATTT 2290 
TATGAAATTC CAAGAAGGGA TGAATAAATA AATCTCTTGG ATGCTGCGTC 2340 
TGGCAGTCTT CACGGGTGGT TTTCAAAGCA GAA21AAAAAA AAAAAAAAAA 2390 
AAAAAA2AAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 2431 
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1 GGGTACCATGGGTCGGTGAGCGCGTTTCCCGCCTGAGCGCAACTAGCGGC 

51 GGGTCGTGGGCACCTCCAGAAAAGATCCCGCACCATCCTCCAGGATCCAA 

101 TGGCCTTGGAGAGAGGGCTGCAGGGCCCACGGACATTGCTGACTCTTCAG 

151 AACGTGCTGACATGGAGCCAGGTAGACTGAAATTATCATGTGTCCAAATT 

2 01 AAAATTGCATACTTCAAGGATTATTTGAAGGACTATTCTTAGACCCTTTT 
251 AAGAAGATTTAAAGAAAAACCACTCGGCCCTGAGTGCGGCGAGGACCCTG 

3 01 TTTGTGGATGTGGAGGAGCGCGGGCCGGAGGCCATGGACGTGAAGGAGAG 

M D V K E R 

3 51 GAAGCCTTACCGCTCGCTGACCCGGCGCCGCGACGCCGAGCGCCGCTACA 

7 KPYRSLTRRRDAERRYT 

4 01 CCAGCTCGTCCGCGGACAGCGAGGAGGGCAAAGCCCCGCAGAAATCGTAC 
24 SSSADSEEGKAPQKSY 

451 AGCTCCAGCGAGACCCTGAAGGCCTACGACCAGGACGCCCGCCTAGCCTA 
40S S SETLKAYDQ DARLAY 

5 01 TGGC AGCCGCGTCAAGGACATTGTGCCGCAGGAGGCCGAGGAATTCTGCC 
57 GSRV KDIVPQEAEEFCR 

5 51 GC ACAGGTGCCAACTTCACCCTGCGGGAGCTGGGGCTGGAAGAAGTAACG 
74 TGANFTLRELGIiEEVT 

eOl CCCCCTCACGGGACCCTGTACCGGACAGACATTGGCCTCCCCCACTGCGG 
90 p PHGTLYRTDIGI/PHCG 

651 CTACTCCATGGGGGCTGGCTCTGATGCCGACATGGAGGCTGACACGGTGC 
107 Y SMGAGSDADMEADTVL 

7 01 TGTCCCCTGAGCACCCCGTGCGTCTGTG6GGCCGGAGCACACGGTCAGGG 
124 S P EH P VRL WG R S T R S G 

751 CGCAGCTCCTGCCTGTCCAGCCGGGCCAATTCCAATCTCACACTC ACCGA 
14 0RS SCIi SSRANS NLTI^TD 

8 01 CACCGAGCATGAAAACACTGAGACTGATCATCCGGGCGGCCTGCAGAACC 
157 TEHENTET DHPGGLQNH 

8 51 ACGCGCGGCTCCGGACGCCGCCGCCGCCGCTCTCGCACGCCCACACCCCC 
174 ARZ-RTPPPPIiSHAHTP 

9 01 AACCAGCACCACGCGGCCTCCATTAACTCCCTGAACCGGGGCAACTTCAC 
19 ON Q H HAA S INS I- N R G N F T 

9 51 GCCGAGGAGCAACCCCAGCCCGGCCCCCACGGACCACTCGCTCTCCGGAG 
207 pRSNPSPAPTDHSIiSGE 

10 01 AGCCCCCTGCCGGCGGCGCCCAGGAGCCTGCCCACGCCCAGGAGAACTGG 
224 P PAGGAQEPAHAQ ENW 

1051 CTGCTCAACAGCAACATCCCCCTGGAGACCAGAAACCTAGGCAAGCAGCC 
240 IiI^N SNIPliETRNIiGKOP 

FIG..7A 
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1101 ATTCCTAGGGACATTGCAGGACAACCTCATTGAGATGGACATTCTCGGCG 
2 57 FIiGTIiQDNi:*! E M D ILGA 

1151 CCTCCCGCCATGATGGGGCTTACAGTGACGGGCACTTCCTCTTCAAGCCT 
274 SRHDGA YS DGHFLFKP 

1201 GGAGGCACCTCCCCGCTCTTCTGCACCACATCACCAGGGTACCCACTGAC 
290 GG TSPLFCTTSPGYPIiT 

1251 GTCCAGCACAGTGTACTCTCCTCCGCCCCGACCCCTGCCCCGCAGCACCT 
307 sSTVYSP PPRPIi P RSTF 

13 01 TCGCCCGGCCGGCCTTTAACCTCAAGAAGCCCTCCAAGTACTGTAACTGG 
324 ARPAFNIiKKP S KYCNW 

13 51 AAGTGCGCAGCCCTGAGCGCCATCGTCATCTCAGCCACTCTGGTCATCCT 
340 K C AALSAIVISATLVIL 

14 01 GCTGGCATACTTTGTGGCCATGCACCTGTTTGGCCTAAACTGGCACCTGC 
357 L A Y F V A M HLFGIiNWHLQ 

14 51 AGCCGATGGAGGGGCAGATGTATGAGATCACGGAGGACACAGCCAGCAGT 
374 PME GQMYE I T E D TA S S 

15 01 TGGCCTGTGCCAACCGACGTCTCCCTATACCCCTCAGGGGGCACTGGCTT 
390W P V P T D V S L Y P S G G T G L 

15 5 1 AGAGACCCCTGACAGGAAAGGCAAAGGAACCACAGAAGGAAAGCCCAGTA 
407 ETPDRKGKG T T E GK^ SS 

16 01 GTTTCTTTCCAGAGGAC AGTTTCATAGATTCTGGAGAAATTGATGTGGGA 
424 FFPE DSFIDSGE IDVG 

1651 AGGCGAGCTTCCCAGAAGATTCCTCCTGGCACTTTCTGGAGATCTCAAGT 
440 RRASQKIPP G TFWRSQV 

17 01 GTTCATAGACCATCCTGTGCATCTGAAATTCAATGTGTCTCTGGGAAAGG 
457 FIDHPVHLKFNV SLGKA 

17 5 1 CAGCCCTGGTTGGCATTTATGGCAGAAAAGGCCTCCCTCCTTCACATACA 
474 ALVGIYGRKGIiP PSHT 

1801 CAGTTTGACTTTGTGGAGCTGCTGGATGGCAGGAGGCTCCTAACCCAGGA 
490 QFDFVELLDGRRIiIiTQE 

1851 GGCGCGGAGCCTAGAGGGGACCCCGCGCCAGTCTCGGGGAACTGTGCCCC 
507 ARSIiEGTPRQS R GTVPP 

19 01 CCTCCAGCCATGAGACAGGCTTCATCCAGTATTTGGATTCAGGAATCTGG 
524 SSHETGFIQYIiD S6IW 

1951 CACTTGGCTTTTTACAATGACGGAAAGGAGTCAGAAGTGGTTTCCTTTCT 
540 HLAFY NDGKESEVVSFL 

F/G._7B 
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9 n m CACCACTGCCATTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGG 
SS7 T TAIAL PPRLKEMKSQE 

2 051 AATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAA 
574 SAAGSKLVLRCETSSE 

2101 TACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCG 
S9 0YS SLRFK WFKNGNELN R 

2151 AAAAAACAAACCACAAAATATCAAGATACAAAAAAAGCCAGGGAAGTCAG 
607 KNKPQNIKIQKKPGKSE 

2201 AACTTCGCATTAACAAAGC ATCACTGGCTGATTCTGGAGAGTATATGTGC 
624 LRINKASLADSGEYMC 

2251 AAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCAT 
640 KVISKLGNDSASANITI 

23 01 CGTGGAATCAAACGAGATC ATCACTGGTATGCC AGCCTCAACTGAAGGAG 
657 VESNEIITGMPASTEGA 

2 3 51 CATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAAGGA 
674 YVSSESPIRISVSTEG 



690 ANTS SSTS TSTTGT ^mmmm 



7 07 

2 501 GCTTCATGGTGAAAGACCTTTC AAACCCCTCGAGATACT^^^ 

724 pisiiiiiMiiMilii^^ 

2 551 CCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGTAATGGCCAGCTT 
7i0 QNYVMASF 

2 601 CT AC AGT ACGTCCACTCCCTTTCTGTCTCTGCCTGAAT AGGAGCATGCTC 
757 YSTSTPFIiSLP E 

2 65 1 AGTTGGTGCTGCTTTCTTGTTGCTGCATCTCCCCTCAGATTCCACCTAGA 

2701 GCTAGATGTGTCTTACCAGATCTAATATTGACTGCCTCTGCCTGTCGC AT 
27 51 GAGAACATTAACAAAAGCAATTGTATTACTTCCTCTGTTCGCGACTAGTT 
2 801 GGCTCTGAGATACTAATAGGTGTGTGAGGCTCCGGATGTTTCTGGAATTG 
2851 ATATTGAATGATGTGATACAAATTGATAGTCAATATCAAGCAGTGAAATA 

— jj^Tj^TAAAGGCATTTCAAAGTCTCACTTTTATTGATAAAATAAAAATC 

PTCTACTGAACAGTCCATCTTCTTTATACAAT6ACCACATCCTGAAAAG 



2901 TGATAATAAAGGCATTTCAAAGTCTCACTTTTATTGATAAAATAAAAATC 

2951 ATTCTACTGAACAGTCCATCTTCTTTATACAAT6ACCACATCCTGAAAAG 

3 001 GGTGTTGCTAAGCTGTAACCGATATGCACTTGAAATGATGGTAAGTTAAT 

3 051 TTTGATTCAGAATGTGTTATTTGTCACAAATAAACATAATAAAAGGAAAA 

3101 AAAAAAAAAAA 
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Sequence Listing 

<110> Genentech, Inc. 

<120> Use of Heregulin as a Growth Factor 

<130> P1145PCT 

5 <141> 1999-02-03 

<150> US 09/020,598 
<151> 1998-02-04 

<160> 14 

<210> 1 
10 <211> 669 
<212> PRT 
<213> Homo sapiens 

<400> 1 

. Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro Ser Arg Asp 
15 1 5 10 15 

Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly Pro Asn Ser Pro 
20 25 30 

Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu Met 
35 40 45 

20 Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys , 

50 55 60 

Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin 
65 70 75 

Ser Pro Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
25 80 85 90 

Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
95 100 105 

Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu 
110 115 120 

30 Leu Asn T^g Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys 

125 130 135 

Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala Asp 
140 145 150 

Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn Asp 
35 155 160 165 

Ser TQa Ser Ala Asn He Thr He Val Glu Ser Asn Glu He He 
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170 175 180 

Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
185 190 195 

Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
5 200 205 210 

Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys 
215 220 225 

Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
230 235 240 

10 Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys 

245 250 255 

Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro 
260 265 270 

Met Lys Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
15 275 280 285 

Arg Val Leu Thr He Thr Gly He Cys He Ala Leu Leu Val Val 
290 295 300 

Gly He Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg 
305 310 315 

20 Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 

320 325 330 

Asn Asn Met Met Asn He Ala Asn Gly Pro His His Pro Asn Pro 
335 340 345 

Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 
25 350 355 360 

Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser 
365 370 375 

Phe Ser Thr Ser His Tyr Thr Ser thr Ala His His Ser Thr Thr 
380 385 390 

30 Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu 

395 400 405 

Ser He Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val 
410 415 420 

Glu Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg 
35 425 430 435 

Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
440 445 450 
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His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser 
455 460 465 

Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Airg Met Ser Pro 
470 475 480 

5 Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu 

485 490 495 

Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met 
500 505 510 

Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val 
10 515 520 525 

Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin 
530 535 540 

Gin Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser 
545 550 555 

15 Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu 

560 565 570 

Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu 
575 580 585 

Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His lie 
20 590 595 600 

Ala Asn T^g Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser 
605 610 615 

Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr 
620 625 630 

25 Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 

635 640 645 

Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly 
650 655 660 

Arg Phe Ser Thr Gin Glu Glu lie Gin 
30 665 669 

<210> 2 
<211> 2226 
<212> DNA 

<213> Homo sapiens 
35 <400> 2 

99 gcg cga gcg cct cag cgc ggc cgc teg etc tec ccc 38 
Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro 
1 5 10 

teg agg gac aaa ctt ttc eca aae ccg ate ega gee ctt 77 
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Ser Arg Asp Lys I*eu Phe Pro Asn Pro lie Arg Ala Leu 
15 20 25 

gga cca aac teg cct gcg ccg aga gcc gtc cgc gta gag 116 
Gly Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu 
5 30 35 

cgc tec gtc tec ggc gag atg tec gag cgc aaa gaa ggc 155 
Arg Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly 
40 45 50 

aga ggc aaa ggg aag ggc aag aag aag gag ega ggc tec 194 
10 Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser 

55 60 

ggc aag aag ccg gag tee gcg gcg ggc age cag age cca 233 
Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro 
65 70 75 

15 gcc ttg cct cec cga ttg aaa gag atg aaa age cag gaa 2 72 
Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 85 90 

teg get gca ggt tec aaa eta gtc ctt egg tgt gaa acc 311 
Ser Ala Ala Gly Ser I^ys Leu Val Leu Arg Cys Glu Thr 
20 95 100 

agt tct gaa tac tec tct etc aga ttc aag tgg ttc aag 3 50 
Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys 
105 110 115 

aat ggg aat gaa ttg aat cga aaa aac aaa cca caa aat 38 9 
25 Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn 

120 125 

ate aag ata caa aaa aag cca ggg aag tea gaa ctt cgc 428 
lie Lys lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
130 135 140 

30 att aac aaa gca tea ctg get gat tct gga gag tat atg 467 
lie Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
145 150 155 

tgc aaa gtg ate age aaa tta gga aat gac agt gee tct 506 
Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser 
35 160 165 

gcc aat ate acc ate gtg gaa tea aac gag ate ate act 545 
Ala Asn He Thr He Val Glu Ser Asn Glu He He Thr 
170 175 180 

ggt atg cca gcc tea act gaa gga gca tat gtg tct tea 584 
4 0 Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

185 190 

gag tct cec att aga ata tea gta tee aca gaa gga gca 623 
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Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly TVla 
195 200 205 

aat act tct tea tct aca tct aca tec ace act ggg aca 662 
Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
5 210 215 220 

age cat ett gta aaa tgt gcg gag aag gag aaa act ttc 701 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe 
225 230 

tgt gtg aat gga ggg gag tgc ttc atg gtg aaa gad ctt 74 0 
10 Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu 
235 240 245 

tea aac ccc teg aga tac ttg tgc aag tgc caa cct gga 779 
Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Gin Pro Gly 
250 255 

15 ttc act gga gca aga tgt act gag aat gtg ccc atg aaa 818 
Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
260 265 270 

gtc caa aac caa gaa aag gcg gag gag ctg tac cag aag 857 
Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
20 275 280 285 

aga gtg ctg ace ata ace ggc ate tgc ate gee etc ctt 896 
Arg Val Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu 
290 295 

gtg gtc ggc ate atg tgt gtg gtg gee tac tgc aaa ace 935 
25 Val Val Gly lie Met Cys Val Val Ala Tyr Cys Lys Thr 
300 305 310 

aag aaa cag egg aaa aag ctg cat gac cgt ctt egg cag 974 
Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
315 320 

30 age ctt egg tct gaa ega aac aat atg atg aac att gee 1013 
Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn lie JVla 
325 330 335 

aat ggg cct cac cat cct aac cca ccc ccc gag aat gtc 1052 
Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val 
35 340 345 350 

cag ctg gtg aat caa tac gta tct aaa aac gtc ate tec 1091 
Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val lie Ser 
355 360 

agt gag cat att gtt gag aga gaa gca gag aca tec ttt 1130 
40 Ser Glu His lie Val Glu Arg Glu Ala Glu Thr Ser Phe 
365 370 375 

tec acc agt cac tat act tec aca gee cat cac tec act 1169 
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Ser Thr 



act gtc 
Thr Val 
5 390 

cac act 
His Thr 



gtg atg 
10 Val Met 



act ggg 
Thr Gly 
430 

15 cct cgt 
Pro Arg 



acc cct 
Thr Pro 
20 455 

tat gtg 
Tyr Val 



gta gat 
25 Val Asp 



teg gaa 
Ser Glu 
495 

30 atg cct 
Met Pro 



aga cct 
Arg Pro 
35 520 

aag aag 
Lys Lys 



cac cac 
40 His His 



age ccc 



Ser His Tyr 
380 

acc cag act 
Thr Gin Thr 



gaa age ate 
Glu Ser lie 
405 

tea tec gta 
Ser Ser Val 
420 

ggc cea aga 
Gly Pro Arg 



gaa tgt aac 
Glu Cys Asn 
445 

gat tec tac 
Asp Ser Tyr 



tea gee atg 
Ser Ala Met 
470 

ttc cac aeg 
Phe His Thr 
485 

atg tct cca 
Met Ser Pro 



tec atg gcg 
Ser Met Ala 
510 

eta ctt etc 
Leu lieu Leu 



ttt gae cat 
Phe Asp His 
535 

aac ccc gcg 
Asn Pro Ala 
550 

ttg agg ata 



Thr Ser Thr 



cct age cac 
Pro Ser His 
395 

ctt tec gaa 
Leu Ser Glu 
410 

gaa aac agt 
Glu Asn Ser 



gga cgt ctt 

Gly Arg Leu 
435 

age ttc etc 

Ser Ph.e Leu 



ega gae tct 
Arg Asp Ser 
460 

acc ace ecg 
Thr Thr Pro 
475 

cca age tec 
Pro Ser Ser 



ccc gtg tec 
Pro Val Ser 
500 

gtc age ccc 
Val Ser Pro 



gtg aca cca 
Val Thr Pro 
525 

cac cct cag 
His Pro Gin 
540 

cat gae agt 
His Asp Ser 



gtg gag gat 



Ala His His 
385 

age tgg age 
Ser Trp Ser 
400 

age cac tct 
Ser His Ser 



agg cac age 
Arg His Ser 
425 

aat ggc aca 
Asn Gly Thr 



agg eat gee 
Arg His Ala 
450 

cet eat agt 
Pro His Ser 
465 

get cgt atg 
Ala Arg Met 



ccc aaa teg 
Pro Lys Ser 
490 

age atg a eg 
Ser Met Thr 



ttc atg gaa 
Phe Met Glu 
515 

cca agg ctg 
Pro Arg Leu 
530 

cag ttc age 
Gin Phe Ser 



aac age etc 
Asn Ser Leu 
555 

gag gag tat 



Ser Thr 



aac gga 1208 
Asn Gly 



gta ate 1247 
Val lie 
415 

age cca 1286 
Ser Pro 



gga ggc 1325 
Gly Gly 
440 

aga gaa 1364 
Arg Glu 



gaa agg 14 03 
Glu Arg 



tea cct 1442 
Ser Pro 
480 

ccc cct 1481 
Pro Pro 



gtg tec 1520 
Val Ser 
505 

gaa gag 1559 
Glu Glu 



egg gag i598 

Arg Glu 



tec ttc 1637 
Ser Phe 
545 

cet get 1676 
Pro Ala 



gaa acg 1715 
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Ser Pro Leu Arg He Val Glu Asp Glu Glu Tyr Glu Thr 
560 565 570 

acc caa gag tac gag cca gcc caa gag cct gtt aag aaa 1754 
Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys 
5 575 580 

etc gcc aat age egg egg gcc aaa aga aec aag cce aat 1793 
Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn 
585 590 595 

ggc cac att get aac aga ttg gaa gtg gac age aac aca 1832 
10 Gly His He Ala Asn Arg Leu Glu Val Asp Ser Asn Thr 
600 605 610 

age tec cag age agt aac tea gag agt gaa aca gaa gat 1871 
Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp 
615 620 

15 gaa aga gta ggt gaa gat acg cct ttc ctg ggc ata cag 1910 
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly He Gin 
625 630 635 

aac cce ctg gca gcc agt ctt gag gca aca cct gcc ttc 1949 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe 
20 640 645 

egc ctg get gac age agg act aac cea gca ggc cgc ttc 1988 
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
650 655 660 

teg aca cag gaa gaa ate cag g ccaggctgtc tagtgtaatt 2 03 0 
25 Ser Thr Gin Glu Glu He Gin 
665 669 

gctaaccaag accctattgc tgtataaaac ctaaataaac acatagattc 2080 

acctgtaaaa ctttatttta tataataaag tattccacct taaattaaac 213 0 

aatttatttt attttagcag ttctgcaaat agaaaacagg aaaaaaactt 2180 

30 ttataaatta aatatatgta tgtaaaaatg aaaaaaaaaa aaaaaa 2226 

<210> 3 

<211> 675 

<212> PRT 

<213> Homo sapiens 

35 <400> 3 

Asp Lys Leu Phe Pro Asn Pro He Arg Ala I^eu Gly Pro Asn Ser 
1 5 10 15 

Pro Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu 
20 25 30 

40 Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
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35 40 45 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

50 55 60 

Gin Ser Pro Ala lieu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 

5 65 70 75 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

80 85 90 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

95 100 105 

10 Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 

110 115 120 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 

125 130 135 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 

15 140 145 150 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 

155 160 165 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

170 175 180 

20 Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 

185 190 195 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser Hxs Leu Val 

200 205 23.0 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 

25 215 220 225 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 

230 235 240 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 

245 250 255 

30 Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala 

260 265 270 

Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr Gly lie Cys 

275 280 285 

lie Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala Tyr Cys 

35 290 295 300 

Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 

305 310 315 
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Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn lie Ala Asn Gly 

320 325 330 

Pro His His Pro Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn 

335 340 345 

5 Gin Tyr Val Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu 

350 355 360 

Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr 

365 370 375 

Ala His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 

10 380 385 390 

Ser Asn Gly His Thr Glu Ser lie Leu Ser Glu Ser His Ser Val 

395 400 405 

lie Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr 

410 415 420 

15 Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu 

425 430 435 

Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 

440 445 450 

Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr 

20 455 460 465 

Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 

470 475 480 

Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr 

485 490 495 

25 Val Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 

500 505 510 

Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys 

515 520 525 

Phe Asp His His Pro Gin Gin Phe Ser Ser Phe His His Asn Pro 

30 530 535 540 

Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 

545 550 555 

Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin 

560 565 570 

35 Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr 

575 580 585 

Lys Pro Asn Gly His lie Ala TVsn Arg Leu Glu Val Asp Ser Asn 

590 595 600 
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Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu 
605 6X0 615 

Arg Val Gly Gl^ Asp Thr Pro Phe Leu Gly lie Gin Asn Pro Leu 
620 625 630 

5 Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser 

635 640 645 

Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin 
650 655 660 

Ala Arg Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 
10 665 670 675 

<210> 4 

<211> 2199 

<212> DNA 

<213> Homo sapiens 

15 <400> 4 

gg gac aaa ctt ttc cca aac ccg ate cga gcc ctt gga 38 
Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly 
15 10 

cca aac teg cct gcg ccg aga gcc gtc cgc gta gag cgc 7 7 
2 0 Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

tec gtc tec gge gag atg tec gag cgc aaa gaa ggc aga 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly Arg 
30 35 

25 ggc aaa ggg aag ggc aag aag aag gag cga ggc tec ggc 155 
Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
40 45 50 

aag aag ccg gag tec gcg gcg ggc age cag age cca gcc 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
30 55 60 

ttg cct ccc caa ttg aaa gag atg aaa age cag gaa teg 233 
Ijeu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser 
65 70 75 

get gea ggt tec aaa eta gtc ctt egg tgt gaa acc agt 272 
35 Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 90 

tet gaa tac tee tct etc aga ttc aag tgg ttc aag aat 311 
Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
95 100 

40 ggg aat gaa ttg aat cga aaa aac aaa cca caa aat ate 350 
Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie 
105 110 115 
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aag ata caa aaa aag cca ggg aag tea gaa ctt cgc att 389 
Lys lie Gin Lys Lys Pro Gly Lys Ser Glu lieu Arg lie 
120 125 

aac aaa gca tea ctg get gat tct gga gag tat atg tgc 42 8 
5 Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
130 135 140 

aaa gtg ate age aaa tta gga aat gac agt gcc tct gee 467 
Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 
145 150 155 

10 aat ate ace ate gtg gaa tea aac gag ate ate act ggt 506 
Asn lie Thr lie Val Glu Ser Asn Glu He He Thr Gly 
160 165 

atg cca gee tea act gaa gga gca tat gtg tct tea gag 54 5 
Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
15 170 175 180 

tct ece att aga ata tea gta tec aca gaa gga gca aat 584 
Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

act tct tea tct aca tct aca tec ace act ggg aca age 623 
20 Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
195 200 205 

eat ett gta aaa tgt geg gag aag gag aaa act tte tgt 662 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 

25 gtg aat gga ggg gag tgc tte atg gtg aaa gac ett tea 701 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
225 230 

aac cec teg aga tac ttg tgc aag tgc cca aat gag ttt 740 
Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
30 235 240 245 

act ggt gat cgc tgc caa aac tac gta atg gee age tte 779 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

tac aag cat ett ggg att gaa ttt atg gag geg gag gag 818 
35 Tyr Lys His Leu Gly He Glu Phe Met Glu Ala Glu Glu 
260 265 270 

ctg tac cag aag aga gtg ctg ace ata ace ggc ate tgc 857 
Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

40 ate gee etc ett gtg gtc ggc ate atg tgt gtg gtg gcc 896 
He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala 
290 295 
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tac tgc aaa acc aag aaa cag egg aaa aag ctg cat gac 
Tyx Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp 
300 305 310 

cgt ctt egg cag age ctt egg tct gaa ega aac aat atg 
5 Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met 
315 320 

atg aac att gee aat ggg cct cac cat cct aac cca ecc 
Met Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro 
325 330 335 

10 ccc gag aat gtc cag ctg gtg aat caa tac gta tct aaa 
Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
340 345 350 

aac gtc ate tec agt gag cat att gtt gag aga gaa gca 
Asn Val He Ser Ser Glu His He Val Glu Arg Glu Ala 
15 355 360 

gag aca tec ttt tec ace agt cac tat act tec aca gee 
Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala 
365 370 375 

cat cac tec act act gtc ace cag act cct age cac age 
20 His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser 

380 385 

tgg age aac gga cac act gaa age ate ctt tec gaa age 
Trp Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser 
390 395 400 

25 cac tct gta ate gtg atg tea tec gta gaa aac agt agg 
His Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg 
405 410 415 

cac age age cca act ggg ggc cca aga gga cgt ctt aat 
His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg I^eu Asn 
30 420 425 

ggc aca gga ggc cct cgt gaa tgt aac age ttc etc agg 
Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
430 435 440 

cat gcc aga gaa ace cct gat tec tac ega gac tct cct 
35 His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro 

445 450 

cat agt gaa agg tat gtg tea gcc atg ace acc ccg get 
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
455 460 465 

40 cgt atg tea cct gta gat ttc cac acg cca age tec ccc 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 
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aaa teg ccc cct teg gaa atg tct cca ccc gtg tec age 1481 
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser 
485 490 

atg acg gtg tec atg cct tec atg gcg gtc age ccc ttc 152 0 
5 Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe 
495 500 505 

atg gaa gaa gag aga cet eta ctt etc gtg aca eea cca 1559 
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro 
510 515 

10 agg etg egg gag aag aag ttt gae cat cac cct cag cag 1598 
Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin 
520 525 530 

ttc age tec ttc cae eac aac ccc gcg cat gae agt aac 1637 
Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn 
15 535 540 545 

age etc cet get age ccc ttg agg ata gtg gag gat gag 1676 
Ser Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu 
550 555 

gag tat gaa acg ace caa gag tae gag cca gee caa gag 1715 
20 Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu 
560 565 570 

cet gtt aag aaa etc gee aat age egg egg gee aaa aga 1754 
Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg 
575 580 

25 ace aag ccc aat ggc cac att get aac aga ttg gaa gtg 1793 
Thr Lys Pro Asn Gly His lie Ala Asn Arg Leu Glu Val 
585 590 595 

gae age aac aca age tec cag age agt aac tea gag agt 1832 
Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser 
30 600 605 610 

gaa aca gaa gat gaa aga gta ggt gaa gat acg cct ttc 1871 
Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 
615 620 

ctg ggc ata cag aac ccc ctg gca gee agt ctt gag gca 1910 
35 Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
625 630 635 

aca cct gee ttc cgc ctg get gae age agg act aac cca 194 9 
Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro 
640 645 

40 'gca ggc cgc ttc teg aca cag gaa gaa ate cag gee agg 1988 
Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin Ala Arg 
650 655 660 
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ctg tct agt gta att get aac caa gac cct att get gta ta 2029 
Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 
665 670 675 

a aacctaaata aacacataga ttcacctgta aaactttatt 2070 

5 ttatataata aagtattcca ccttaaatta aacaatttat tttattttag 2120 

cagttctgca aatagaaaac aggaaaaaaa cttttataaa ttaaatatat 2170 

gtatgtaaaa atgaaaaaaa aaaaaaaaa 2199 

<210> 5 

<211> 637 

10 <212> PRT 

<213> Homo sapiens 

<400> 5 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
15 10 15 

15 Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
20 50 55 60 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 - 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 
80 85 90 

25 Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 

95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn 
110 115 120 

Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Glu He 
30 125 130 135 

He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
140 145 150 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
155 160 165 

35 Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

170 175 180 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
185 190 195 
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Cys Phe Met Val Lys Asp Leu Sex Asn Pro Ser Arg Tyr Leu Cys 
200 205 2X0 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
215 220 225 

5 Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 

230 235 240 

Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu Val Val Gly lie 
245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
10 260 265 270 

Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 
275 280 285 

Met Met Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

15 Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He 

305 310 315 

Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 
320 325 330 

Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 
20 335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 
350 355 360 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn 
365 370 375 

25 Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 

380 385 390 

Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 
395 400 405 

Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
30 410 415 420 

Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp 
425 430 435 

Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser 
440 445 450 

35 Pro Pro Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 

455 460 465 

Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro 
470 475 480 
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Pro Arg Leu 
Ser Ser Phe 
5 Ala Ser Pro 
Gin Glu Tyr 
Ser Arg Arg 

10 

Arg Leu Glu 
Glu Ser Glu 
15 Leu Gly He 
Ala Phe Arg 
Ser Thr Gin 

20 

Asn Gin Asp 

<210> 6 
<211> 2490 
25 <212> DNA 

<213> Home sapiens 

<400> 6 

gtggctgcgg ggcaattgaa aaagagccgg cgaggagttc cccgaaactt 50 

gttggaactc cgggctcgcg cggaggccag gagctgagcg gcggcggctg 100 

30 ccggacgatg ggagcgtgag caggacggtg ataacctctc cccgatcggg 150 

ttgcgagggc gccgggcaga ggccaggacg cgagccgcca gcggcgggac 200 

ccatcgacga cttcccgggg cgacaggagc agccccgaga gccagggcga 250 

gcgcccgttc caggtggccg gaccgcccgc cgcgtccgcg ccgcgctccc 300 

tgcaggcaac gggagacgcc cccgcgcagc gcgagcgcct cagcgcggcc 350 

35 gctcgctctc cccatcgagg gacaaacttt tcccaaaccc gatccgagcc 400 

cttggaccaa actcgcctgc gccgagagcc gtccgcgtag agcgctccgt 450 
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Arg Glu Lys Lys Phe Asp His His Pro Gin Gin Phe 

485 490 495 

His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro 

500 505 510 

Leu Arg He Val Glu Asp Glu Glu Tyr Glu Thr Thr 

515 520 525 

Glu Pro Ala Gin Glu Pro Val Lys Lys Leu Ala Asn 

530 535 540 

Ala Lys Arg Thr Lys Pro Asn Gly His He Ala Asn 

545 550 555 

Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser 

560 565 570 

Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 

575 580 585 

Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro 

590 595 600 

Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 

605 610 615 

Glu Glu He Gin Ala Arg Leu Ser Ser Val He Ala 

620 625 630 

Pro He Ala Val 
635 637 
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ctccggcgag atg tec gag cgc aaa gaa ggc aga ggc aaa 490 
Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 
1 5 10 

ggg aag ggc aag aag aag gag cga ggc tec ggc aag aag 529 
5 Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 

15 20 

ccg gag tec gcg geg ggc age cag age eea gee ttg cct 568 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 . 35 

10 cce caa ttg aaa gag atg aaa age cag gaa teg get gca 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 45 

ggt tec aaa eta gtc ett egg tgt gaa acc agt tct gaa 646 
Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
15 50 55 60 

tac tee tet etc aga tte aag tgg ttc aag aat ggg aat 685 
Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

gaa ttg aat cga aaa aac aaa cca caa aat ate aag ata 724 
20 Glu ^eu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 

80 85 

caa aaa aag eea ggg aag tea gaa ctt cge att aac aaa 763 
Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys 
90 95 100 

25 gca tea etg get gat tct gga gag tat atg tgc aaa gtg 802 
Ala Ser ^eu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
105 110 

ate age aaa tta gga aat gac agt gee tct gee aat ate 841 
lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie 
30 115 120 125 

acc ate gtg gaa tea aac gag ate ate act ggt atg cca 880 
Thr lie Val Glu Ser Asn Glu lie lie Thr Gly Met Pro 
130 135 140 

gee tea act gaa gga gca tat gtg tct tea gag tct ecc 919 
35 Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 

145 150 

att aga ata tea gta tec aca gaa gga gca aat act tct 958 
lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

40 tea tct aea tct aca tec acc act ggg aca age cat ett 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175 
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gta aaa 
Val Lys 
180 

gga ggg 
5 Gly Gly 



teg aga 
Ser Arg 



10 gat cgc 
Asp Arg 
220 

gcg gag 
Ala Glu 

15 

ggc ate 
Gly He 
245 

gtg gtg 
20 v'al Val 



ctg cat 
Leu His 



25 aac aat 
Asn Asn 
285 

aac cca 
Asn Pro 

30 

gta tct 
Val Ser 
310 

aga gaa 
35 Arg Glu 



tec aca 
Ser Thr 



40 age cac 
Ser His 
350 



tgt gcg gag 
Cys Ala Glu 



gag tgc ttc 
Glu Cys Phe 
195 

tac ttg tge 
Tyr Leu Cys 
210 

tge caa aac 
Cys Gin Asn 



gag ctg tac 
Glu Leu Tyr 
235 

tge ate gcc 
Cys lie Ala 



gcc tac tge 
Ala Tyr Cys 
260 

gae cgt ctt 
Asp Arg Leu 
275 

atg atg aac 
Met Met Asn 



ccc ccc gag 
Pro Pro Glu 
300 

aaa aac gtc 
Lys Asn Val 



gca gag aca 
Ala Glu Thr 
325 

gcc cat cac 
Ala His His 
340 

age tgg age 
Ser Trp Ser 



aag gag aaa 
Lys Glu Lys 
185 

atg gtg aaa 
Met Val Lys 
200 

aag tgc cca 
Lys Cys Pro 



tac gta atg 
Tyr Val Met 
225 

cag aag aga 
Gin Lys Arg 



etc ctt gtg 
Leu Leu Val 
250 

aaa acc aag 
Lys Thr Lys 
265 

egg cag age 
Arg Gin Ser 



att gcc aat 
He Ala Asn 
290 

aat gtc cag 
Asn Val Gin 



ate tec agt 
He Ser Ser 
315 

tec ttt tec 
Ser Phe Ser 
330 

tec act act 
Ser Thr Thr 



aac gga cac 
Asn Gly His 
355 



act ttc tgt 
Thr Phe Cys 
190 

gae ctt tea 
Asp Leu Ser 



aat gag ttt 
Asn Glu Phe 
215 

gee age ttc 
Ala Ser Phe 



gtg ctg acc 
Val Leu Thr 
240 

gtc ggc ate 
Val Gly He 
255 

aaa cag egg 
Lys Gin Arg 



ctt egg tct 
Leu Arg Ser 
280 

ggg cct cac 
Gly Pro His 



ctg gtg aat 
Leu Val Asn 
305 

gag cat att 
Glu His He 
320 

ace agt cac 
Thr Ser His 



gtc ace cag 
Val Thr Gin 
345 

act gaa age 
Thr Glu Ser 



gtg aat 1036 
Val Asn 



aac ccc 1075 
Asn Pro 
205 

act ggt 1114 
Thr Gly 



tac aag 1153 
Tyr Lys 
230 

ata acc 1192 
He Thr 



atg tgt 1231 
Met Cys 



aaa aag 1270 
Lys Lys 
270 

gaa ega 1309 
Glu Arg 



cat cct 1348 
His Pro 
295 

caa tac 1387 
Gin Tyr 



gtt gag 1426 
Val Glu 



tat act 1465 
Tyr Thr 
335 

act cct 1504 
Thr Pro 



ate ctt 1543 
He Leu 
360 
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tec gaa age cac tct gta ate gtg atg tea tee gta gaa 1582 
Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu 
365 370 



aac agt 
5 Asn Ser 
375 

cgt ctt 
Arg Leu 



10 ttc etc 
Phe Leu 



gac tct 
Asp Ser 
15 415 

acc ccg 
Thr Pro 



agg cac age 
Arg His Ser 



aat ggc aca 
Asn Gly Thr 
390 

agg eat gee 
Arg His Ala 
405 

cct cat agt 
Pro His Ser 



get cgt atg 
Ala Arg Met 
430 



age cea act 
Ser Pro Thr 
380 

gga ggc cct 
Gly Gly Pro 
395 

aga gaa acc 
Arg Glu Thr 



gaa agg tat 
Glu Arg Tyr 
420 

tea cct gta 
Ser Pro Val 



ggg ggc cca 
Gly Gly Pro 
385 

cgt gaa tgt 
Arg Glu Cys 



cct gat tec 
Pro Asp Ser 
410 

gtg tea gee 
Val Ser Ala 



gat ttc cac 
Asp Phe His 
435 



aga gga 1621 
Arg Gly 



aac age 1660 
Asn Ser 
400 

tac cga 1699 
Tyr Arg 



atg ace 1738 
Met Thr 
425 

acg cca 1777 
Thr Pro 



age tec 
20 Ser Ser 
440 

gtg tec 
Val Ser 



25 age cec 
Ser Pro 



aca cca 
Thr Pro 
30 480 

cct cag 
Pro Gin 



gac agt 
35 Asp Ser 
505 

gag gat 
Glu Asp 



40 gee caa 
Ala Gin 



cec aaa teg 
Pro Lys Ser 



age atg acg 
Ser Met Thr 
455 

ttc atg gaa 
Phe Met Glu 
470 

cca agg ctg 
Pro Arg Leu 



cag ttc age 
Gin Phe Ser 
495 

aac age etc 
Asn Ser Leu 



gag gag tat 
Glu Glu Tyr 
520 

gag cct gtt 
Glu Pro Val 
535 



cec cct teg 
Pro Pro Ser 
445 

gtg tec aag 
Val Ser Lys 
460 

gaa gag aga 
Glu Glu Arg 



egg gag aag 
Arg Glu Lys 
485 

tec ttc cac 
Ser Phe His 



cct get age 
Pro Ala Ser 
510 

gaa acg acc 
Glu Thr Thr 
525 

aag aaa etc 
Lys Lys Leu 



gaa atg tct 
Glu Met Ser 
450 

cct tec atg 
Pro Ser Met 



cct eta ctt 
Pro Leu Leu 
475 

aag ttt gac 
Lys Phe Asp 



cac aac cec 
His Asn Pro 
500 

cec ttg agg 
Pro Leu Arg 
515 

caa gag tac 
Gin Glu Tyr 



gee aat age 
Ala Asn Ser 
540 



cca cec 1816 
Pr-o Pro 



gcg gtc 1855 
Ala Val 
465 

etc gtg 1894 
Leu Val 



cat cac 1933 
His His 
490 

gcg cat 1972 
Ala His 



ata gtg 2011 
lie Val 



gag cca 2050 
Glu Pro 
530 

egg egg 2089 
Arg Arg 
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gcc aaa aga acc aag ccc aat ggc cac att get aac aga 2128 
Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn Arg 
545 550 555 

ttg gaa gtg gac age aac aca age tec eag age agt aac 216 7 
5 Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 



tea gag agt gaa aea gaa gat gaa aga gta ggt gaa gat 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 



10 aeg cct ttc ctg ggc ata cag aac ccc ctg gca gcc agt 2245 
Thr Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser 
585 590 '595 

ctt gag gca aca cct gcc ttc cgc ctg get gac age agg 2284 
Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 
15 600 605 



act aac cca gca ggc cgc ttc teg aea cag gaa gaa ate 2323 
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie 
610 615 620 

cag gcc agg ctg tct agt gta att get aac caa gac cct 2362 
20 Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin Asp Pro 

625 630 

att get gta taaaaceta aataaacaca tagattcacc tgtaaaaett 2410 
He Ala Val 
635 637 

25 tattttatat aataaagtat tccaccttaa attaaacaat ttattttatt 2460 



ttagcagttc tgcaaataaa aaaaaaaaaa 2490 

<210> 7 
<211> 241 
<212> PRT 
30 <213> Homo sapiens 



<400> 7 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
1 5 10 15 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
35 20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
50 55 60 

40 Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 



-20- 



wo 99/39729 



PCTAJS99/02390 



Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 

80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 

95 100 105 

5 Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 

110 115 12 0 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 

125 130 135 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

10 140 145 150 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 

155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

170 175 180 

15 Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 

185 190 195 



Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
20 215 220 225 

Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
230 235 240 



Glu 
241 



25 <210> 8 

<211> 1715 
<212> DNA 

<213> Homo sapiens 



<400> 8 

30 gcgcctgcct ccaacctgcg ggcgggaggt gggtggctgc ggggcaattg 50 
aaaaagagcc ggcgaggagt tccccgaaac ttgttggaac tccgggctcg 100 
cgcggaggcc aggagctgag cggcggcggc tgccggacga tgggagcgtg 150 
agcaggacgg tgataacctc tccccgatcg ggttgcgagg gcgccgggca 200 
gaggccagga cgcgagccgc cagcggcggg acccatcgac gacttcccgg 250 

35 ggcgacagga gcagccccga gagccagggc gagcgcccgt tccaggtggc 300 
cggaccgccc gccgcgtccg cgccgcgctc cctgcaggca acgggagacg 350 
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cccccgcgca gcgcgagcgc ctcagcgcgg ccgctcgctc tccccatcga 400 

gggacaaact tttcccaaac ccgatccgag cccttggacc aaactcgcct 450 

gcgccgagag ccgtccgcgt agagcgctcc gtctccggcg ag atg 4 95 

Met 

5 1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 534 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 573 
10 Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

gcg ggc age cag age cea gee ttg ect cec caa ttg aaa 612 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

15 gag atg aaa age cag gaa teg get gea ggt tee aaa eta 651 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

gtc ctt egg tgt gaa ace agt tct gaa tac tec tet etc 690 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
20 55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 729 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 

aaa aac aaa cea caa aat ate aag ata caa aaa aag cea 768 
25 Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gca tea ctg get 807 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

30 gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 846 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

gga aat gac agt gee tct gee aat ate ace ate gtg gaa 885 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
35 120 125 130 

tea aac gag ate ate act ggt atg cea gee tea act gaa 924 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

gga gca tat gtg tct tea gag tct cec att aga ata tea 963 
40 Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg He Ser 
145 150 155 
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gta tec aca gaa gga gca aat act tct tea tct aca tct 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

aca tec ace act ggg aca age cat ett gta aaa tgt gcg 1041 
5 Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 

175 180 

gag aag gag aaa act ttc tgt gtg aat gga ggg gag tge 1080 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

10 ttc atg gtg aaa gae ett tea aac cec teg aga tac ttg 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

tge aag tge cca aat gag ttt act ggt gat cgc tge caa 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin / 
15 210 215 220 

aac tac gta atg gee age ttc tac agt acg tec act ecc 1197 
Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

ttt ctg tct ctg cet gaa tagga gcatgctcag ttggtgctgc 1240 
20 Phe Leu Ser Leu Pro Glu 

240 241 

tttettgttg etgcatctcc cetcagattc cacctagagc tagatgtgtc 1290 
ttaccagate taatattgae tgcetctgcc tgtegcatga gaacattaac 1340 
aaaagcaatt gtattactte etctgttcgc gactagttgg ctctgagata 13 90 

25 etaataggtg tgtgaggctc cggatgtttc tggaattgat attgaatgat 1440 
gtgatacaaa ttgatagtca atatcaagca gtgaaatatg ataataaagg 1490 
catttcaaag tctcactttt attgataaaa taaaaatcat tctactgaac 1540 
agtccatctt ctttatacaa tgaccacatc ctgaaaaggg tgttgctaag 1590 
ctgtaaccga tatgcacttg aaatgatggt aagttaattt tgattcagaa 1640 

30 tgtgttattt gtcacaaata aacataataa aaggagttca gatgtttttc 1690 

ttcattaacc aaaaaaaaaa aaaaa 1715 

<210> 9 
<211> 420 
<212> PRT 
35 <213> Homo sapiens 

<400> 9 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
15 10 15 
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Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 

35 40 45 

5 Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

50 55 60 

Ser Glu Tyr Ser Ser Leu T^g Phe Lys Trp Phe Lys Asn Gly Asn 

€5 70 75 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 

10 80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 

95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 

110 115 120 

X5 Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 

125 130 135 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

140 145 150 

Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 

20 155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

170 175 180 

Lys Cys Ala Glu Lys Glu Lys Tiir Phe Cys Val Asn Gly Gly Glu 

185 190 195 

25 Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 

200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 

215 220 225 

Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 

30 230 235 240 

Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu Val Val Gly lie 

245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 

260 265 270 

35 Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 

275 280 285 

Met Met Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 

290 295 300 
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Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val lie 
305 310 315 

Ser Ser Glu His lie Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 
320 325 330 

5 Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 

335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser lie 
350 355 360 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn 
10 365 370 375 

Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
380 385 390 

Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 
395 400 405 

15 Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 

410 415 420 

<210> 10 
<211> 2431 
<212> DNA 
20 <213> Homo sapiens 

<400> 10 

gaggcgcctg cctccaacct gcgggcggga ggtgggtggc tgcggggcaa 50 
ttgaaaaaga gccggcgagg agttccccga aacttgttgg aactccgggc 100 
tcgcgcggag gccaggagct gagcggcggc ggctgccgga cgatgggagc 150 

25 gtgagcagga cggtgataac ctctccccga tcgggttgcg agggcgccgg 200 
gcagaggcca ggacgcgagc cgccagcggc gggacccatc gacgacttcc 250 
cggggcgaca ggagcagccc cgagagccag ggcgagcgcc cgttccaggt 300 
ggccggaccg cccgccgcgt ccgcgccgcg ctccctgcag gcaacgggag 350 
acgcccccgc gcagcgcgag cgcctcagcg cggccgctcg ctctccccat 400 

30 cgagggacaa acttttccca aacccgatcc gagcccttgg accaaactcg 450 

cctgcgccga gagccgtccg cgtagagcgc tccgtctccg gcgag atg 498 

Met 
1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 537 
35 Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 

5 10 
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aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

gcg ggc age cag age cca gee ttg cct ccc caa ttg aaa 
5 Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

gag atg aaa age cag gaa teg get gea ggt tec aaa eta 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

10 gtc ctt egg tgt gaa aec agt tct gaa tae tec tct etc 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
15 70 75 

aaa aac aaa cca caa aat ate aag ata caa aaa aag cca 
Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gea tea etg get 
20 Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

25 gga aat gae agt gee tct gee aat ate aec ate gtg gaa 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

tea aac gag ate ate act ggt atg cea gee tea act gaa 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
30 135 140 

gga gea tat gtg tct tea gag tct ccc att aga ata tea 
Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

gta tec aca gaa gga gea aat act tct tea tct aca tct 
35 Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

aca tec ace act ggg aca age eat ctt gta aaa tgt gcg 
Thr Ser Thr Thr Gly Thr Ser His tieu Val Lys Cys Ala 
175 180 

40 gag aag gag aaa act ttc tgt gtg aat gga ggg gag tgc 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 
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615 

654 

693 

732 

771 

810 

849 

888 

927 

966 

1005 

1044 

1083 
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ttc atg gtg aaa gac ctt tea aac ccc teg aga tac ttg 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

tgc aag tgt cca aat gag ttt act ggt gat ege tgc caa 1161 
5 Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

aac tac gta atg gcc age ttc tac aag gcg gag gag ctg 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

10 tac cag aag aga gtg ctg ace at a ace ggc ate tgc ate 123 9 
Tyr Gin Lys Arg Val Leu Thr lie Thr Gly lie Cys lie 
240 245 

gcc etc ctt gtg gtc ggc ate atg tgt gtg gtg gcc tac 1278 
Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala Tyr 
15 250 255 260 

tgc aaa acc aag aaa cag egg aaa aag ctg cat gac cgt 1317 
Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg 
265 270 

ctt egg cag age ctt egg tet gaa ega aac aat atg atg 13 56 
20 Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 
275 280 285 

aac att gcc aat ggg ect cac cat cet aac cca ccc ccc 1395 
Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

25 gag aat gtc cag ctg gtg aat caa tac gta tet aaa aac 1434 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 
305 310 

gtc ate tec agt gag cat att gtt gag aga gaa gca gag 1473 
Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu 
30 315 320 325 

aca tec ttt tec acc agt cac tat act tec aca gcc cat 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

cac tee act act gtc ace cag act ect age cac age tgg 1551 
35 His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
340 345 350 

age aac gga cac act gaa age ate ctt tec gaa age cac 1590 
Ser Asn Gly His Thr Glu Ser lie I^eu Ser Glu Ser His 
355 360 365 

40 tet gta ate gtg atg tea tee gta gaa aac agt agg cac 162 9 
Ser Val lie Val Met Ser Ser Val Glu Asn Ser Arg His 
370 375 
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age age cca act ggg ggc cca aga gga cgt ctt aat ggc 1668 
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly 
380 385 390 

aca gga ggc cct cgt gaa tgt aac age ttc etc agg cat 1707 
5 Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

gee aga gaa ace cct gat tec tac cga gac tct cct cat 1746 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 
405 410 415 

10 agt gaa agg taaaa cegaaggcaa agctactgca gaggagaaac 1790 
Ser Glu Arg 
420 

tcagtcagag aatccctgtg agcacctgcg gtctcacctc aggaaatcta 184 0 
ctctaateag aataaggggc ggcagttacc tgttctagga gtgctcctag 1890 

15 ttgatgaagt catctctttg tttgacggaa cttatttett ctgagcttct 194 0 
ctegtcgtcc cagtgactga caggcaacag actcttaaag agctgggatg 1990 
ctttgatgcg gaaggtgcag cacatggagt ttccagctct ggccatgggc 204 0 
teagacccae teggggtctc agtgtcctca gttgtaacat tagagagatg 2090 
geatcaatgc ttgataagga cccttetata attccaattg ccagttatcc 214 0 

20 aaactctgat tcggtggteg agctggcctc gtgttcttat ctgctaacec 2190 
tgtcttacct tccagcctca gttaagtcaa ateaaggget atgtcattgc 2240 
tgaatgtcat ggggggcaac tgcttgccct ecacectata gtatctattt 2290 
tatgaaattc caagaaggga tgaataaata aatctcttgg atgctgcgte 2340 
tggcagtctt cacgggtggt tttcaaagca gaaaaaaaaa aaaaaaaaaa 2390 

25 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 2431 

<210> 11 
<211> 768 
<212> PRT 

<213> Homo sapiens 
30 <400> 11 

Met Asp Val Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg 
1 5 10 15 

Arg Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser Glu 
20 25 30 

35 Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser Glu Thr Leu 

35 40 45 
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Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala Tyr Gly Ser Arg Val 
50 55 60 

Lys Asp lie Val Pro Gin Glu Ala Glu Glu Phe Cys Arg Thr Gly 
65 70 75 

5 Ala Asn Phe Thr Leu Arg Glu Leu Gly Leu Glu Glu Val Thr Pro 

80 85 90 

Pro His Gly Thr Leu Tyr Arg Thr Asp lie Gly Leu Pro His Cys 
95 100 105 

Gly Tyr Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
10 110 115 120 

Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly Arg Ser 
125 130 135 

Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser Arg Ala Asn Ser 
140 145 150 

15 Asn Leu Thr Leu Thr Asp Thr Glu His Glu Asn Thr Glu Thr Asp 

155 160 165 

His Pro Gly Gly Leu Gin Asn His Ala Arg Leu Arg Thr Pro Pro 
170 175 180 

Pro Pro Leu Ser His Ala His Thr Pro Asn Gin His His Ala Ala 
20 185 190 195 

Ser lie Asn Ser Leu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn 
200 205 210 

Pro Ser Pro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro Pro 
215 220 225 

25 Ala Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu Asn Trp Leu 

230 235 240 

Leu Asn Ser Asn He Pro Leu Glu Thr Arg Asn Leu Gly Lys Gin 
245 250 255 

Pro Phe Leu Gly Thr Leu Gin Asp Asn Leu He Glu Met Asp He 
30 260 265 270 

Leu Gly Ala Ser Arg His Asp Gly Ala Tyr Ser Asp Gly His Phe 
275 280 285 

Leu Phe Lys Pro Gly Gly Thr Ser Pro Leu Phe Cys Thr Thr Ser 
290 295 300 

35 Pro Gly Tyr Pro Leu Thr Ser Ser Thr Val Tyr Ser Pro Pro Pro 

305 310 315 

Arg Pro Leu Pro Arg Ser Thr Phe Ala Arg Pro Ala Phe Asn Leu 
320 325 330 
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Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys Ala Ala Leu Ser 
335 340 345 

Ala lie Val lie Ser Ala Thr Leu Val lie Leu Leu Ala Tyr Phe 
350 355 360 

5 Val Ala Met His Leu Phe Gly Leu Asn Trp His Leu Gin Pro Met 

365 370 375 

Glu Gly Gin Met Tyr Glu lie Thr Glu Asp Thr Ala Ser Ser Trp 
380 385 390 

Pro Val Pro Thr Asp Val Ser Leu Tyr Pro Ser Gly Gly Thr Gly 
10 395 400 405, 

Leu Glu Thr Pro Asp Arg Lys Gly Lys Gly Thr Thr Glu Gly Lys 
410 415 420 

Pro Ser Ser Phe Phe Pro Glu Asp Ser Phe lie Asp Ser Gly Glu 
425 430 435 

15 lie Asp Val Gly Arg Arg Ala Ser Gin Lys lie Pro Pro Gly Thr 

440 445 450 

Phe Trp Arg Ser Gin Val Phe lie Asp His Pro Val His Leu Lys 
455 460 465 

Phe Asn Val Ser Leu Gly Lys Ala TQa Leu Val Gly lie Tyr Gly 
20 470 475 480 

Arg Lys Gly Leu Pro Pro Ser His Thr Gin Phe Asp Phe Val Glu 
485 490 495 

Leu l.eu Asp Gly Arg Arg Leu Leu Thr Gin Glu Ala Arg Ser Leu 
500 505 510 

25 Glu Gly Thr Pro Arg Gin Ser Arg Gly Thr Val Pro Pro Ser Ser 

515 520 525 

His Glu Thr Gly Phe lie Gin Tyr Leu Asp Ser Gly lie Trp His 
530 535 540 

Leu Ala Phe Tyr Asn Asp Gly Lys Glu Ser Glu Val Val Ser Phe 
30 545 550 555 

Leu Thr Thr Ala lie Ala Leu Pro Pro Arg Leu Lys Glu Met Lys 
560 565 570 

Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu 
575 580 585 

35 Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 

590 595 600 

Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 
605 610 615 
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Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser 
620 625 630 

Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
635 640 645 

5 Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn 

650 655 660 

Glu lie He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val 
665 670 675 

Ser Ser Glu Ser Pro He Arg lie Ser Val Ser Thr Glu Gly Ala 
10 680 685 690 

Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
695 700 70S 

Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly 
710 715 720 

15 Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

725 730 735 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn 
740 745 750 

Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser 
20 755 760 765 

Leu Pro Glu 
768 

<2X0> 12 
<211> 3111 
25 <212> DNA 

<213> Homo sapiens 

<400> 12 

gggtaccatg ggtcggtgag cgcgtttccc gcctgagcgc aactagcggc 50 

gggtcgtggg cacctccaga aaagatcccg caccatcctc caggatccaa 100 

30 tggccttgga gagagggctg cagggcccac ggacattgct gactcttcag 150 

aacgtgctga catggagcca ggtagactga aattatcatg tgtccaaatt 200 

aaaattgcat acttcaagga ttatttgaag gactattctt agaccctttt 250 

aagaagattt aaagaaaaac cactcggccc tgagtgcggc gaggaccctg 300 

tttgtggatg tggaggagcg cgggccggag gcc atg gac gtg 342 
35 Met Asp Val 

1 

aag gag agg aag cct tac cgc teg ctg acc egg cgc cgc 381 
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Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg Arg 
5 10 15 

gac gcc gag cgc cgc tac acc age teg tec gcg gac age 420 
Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser 
5 20 25 



gag gag ggc aaa gcc ccg cag aaa teg tac age tec age 459 

Glu Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser 

30 35 40 

gag acc ctg aag gcc tac gac cag gac gcc cgc eta gcc 498 

10 Glu Thr Leu Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala 

45 50 55 



tat ggc age cgc gtc aag gac att gtg ccg cag gag gcc 537 
Tyr Gly Ser Arg Val Lys Asp lie Val Pro Gin Glu Ala 
60 65 

15 gag gaa ttc tgc cgc aca ggt gcc aac ttc acc ctg egg 576 
Glu Glu Phe Cys Arg Thr Gly Ala Asn Phe Thr Leu Arg 
70 75 80 



gag ctg ggg ctg gaa gaa gta acg ccc cct cac ggg acc 615 
Glu Leu Gly Leu Glu Glu Val Thr Pro Pro His Gly Thr 
20 85 90 

Ctg tac egg aca gac att ggc etc ccc cac tgc ggc tac 654 
Leu Tyr Arg Thr Asp lie Gly Leu Pro His Cys Gly Tyr 
95 100 105 



tec atg ggg get ggc tet gat gcc gac atg gag get gac 693 
25 Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
110 115 120 

acg gtg ctg tec cct gag cac ccc gtg cgt ctg tgg ggc 732 
Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly 
125 130 

30 egg age aca egg tea ggg cgc age tec tgc ctg tec age 771 
Arg Ser Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser 
135 140 145 

egg gcc aat tec aat etc aca etc acc gac ace gag cat 810 
Arg Ala Asn Ser Asn Leu Thr Leu Thr Asp Thr Glu His 
35 150 155 

gaa aac act gag act gat cat ccg ggc ggc ctg cag aac 849 
Glu Asn Thr Glu Thr Asp His Pro Gly Gly Leu Gin Asn 
160 165 170 

cac gcg egg etc egg acg ccg ccg ccg ccg etc teg cac 888 
40 His Ala Arg Leu Arg Thr Pro Pro Pro Pro Leu Ser His 
175 180 185 



gcc cac acc ccc aac cag cac cac gcg gee tec att aac 927 
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Ala His Thr Pro Asn Gin His His Ala Ala Ser lie Asn 
190 195 

tec ctg aac egg ggc aac ttc acg ecg agg age aac ccc 966 
Ser I^eu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn Pro 
5 200 205 210 

age ccg gee eec acg gac cac teg etc tec gga gag cce 1005 
Ser Pro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro 
215 220 

cct gee gge gge gee cag gag cet gee cac gee cag gag 1044 
10 Pro Ala Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu 
225 230 235 

aac tgg ctg etc aac age aac ate ccc ctg gag ace aga 1083 
Asn Trp Leu Leu Asn Ser Asn lie Pro Leu Glu Thr Arg 
240 245 250 

15 aac eta ggc aag cag cca ttc eta ggg aca ttg cag gac 1122 
Asn Leu Gly Lys Gin Pro Phe Leu Gly Thr Leu Gin Asp 
255 260 

aac etc att gag atg gac att etc ggc gee tec cgc cat 1161 
Asn Leu lie Glu Met Asp lie Leu Gly Ala Ser Arg His 
20 265 270 275 

gat ggg get tac agt gac ggg cac ttc etc ttc aag cct 1200 
Asp Gly Ala Tyr Ser Asp Gly His Phe Leu Phe Lys Pro 
280 285 

gga ggc ace tec ccg etc ttc tgc ace aca tea cca ggg 1239 
25 Gly Gly Thr Ser Pro Leu Phe Cys Thr Thr Ser Pro Gly 
290 295 300 

tac cca ctg acg tee age aca gtg tac tet cct ccg ccc 1278 
Tyr Pro Leu Thr Ser Ser Thr Val Tyr Ser Pro Pro Pro 
305 310 315 

30 cga cce ctg ccc cgc age acc ttc gcc egg ccg gee ttt 1317 
Arg Pro Leu Pro Arg Ser Thr Phe Ala Arg Pro Ala Phe 
320 325 

aae etc aag aag ccc tec aag tac tgt aac tgg aag tgc 1356 
Asn Leu Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys 
35 330 335 340 

gca gcc ctg age gee ate gtc ate tea gcc act ctg gtc 1395 
Ala Ala Leu Ser Ala lie Val lie Ser Ala Thr Leu Val 
345 350 

ate ctg ctg gca tac ttt gtg gee atg cac ctg ttt ggc 1434 
40 He Leu Leu Ala Tyr Phe Val Ala Met His Leu Phe Gly 
355 360 365 

eta aac tgg cac ctg cag ccg atg gag ggg cag atg tat 1473 
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Leu Asn 



gag ate 
Glu lie 

5 

acc gac 
Thr Asp 
395 

gag acc 
10 Glu Thr 



aag ccc 
Lys Pro 
420 

15 tct gga 
Ser Gly 

att cct 
lie Pro 

20 

gac cat 
Asp His 
460 

aag gca 
25 Lys Ala 



cct cct 
Pro Pro 
485 

30 gat ggc 
Asp Gly 



gag ggg 
Glu Gly 

35 

tec age 
Ser Ser 
525 

gga ate 
40 Gly lie 



tea gaa 



Trp His Leu 
370 

aeg gag gac 
Thr Glu Asp 
385 

gtc tec eta 
Val Ser Leu 



cct gac agg 
Pro Asp Arg 
410 

agt agt ttc 
Ser Ser Phe 



gaa att gat 
Glu lie Asp 
435 

cct ggc act 
Pro Gly Thr 
450 

cct gtg cat 
Pro Val His 



gee ctg gtt 
Ala Leu Val 
475 

tea eat aca 
Ser His Thr 



agg agg etc 
Arg Arg Leu 
500 

acc ccg cgc 
Thr Pro Arg 
515 

cat gag aca 
His Glu Thr 



tgg cae ttg 
Trp His Leu 
540 

gtg gtt tec 



Gin Pro Met 
375 

aca gee age 
Thr Ala Ser 



tac ccc tea 
Tyr Pro Ser 
400 

aaa ggc aaa 
Lys Gly Lys 



ttt cca gag 
Phe Pro Glu 
425 

gtg gga agg 
Val Gly Arg 
440 

ttc tgg aga 
Phe Trp Arg 



ctg aaa ttc 
Leu Lys Phe 
465 

ggc att tat 
Gly He Tyr 



cag ttt gac 
Gin Phe Asp 
490 

eta acc cag 
Leu Thr Gin 
505 

cag tct egg 
Gin Ser Arg 



ggc ttc ate 
Gly Phe He 
530 

get ttt tac 
Ala Phe Tyr 



ttt etc ace 



Glu Gly Gin 



agt tgg cct 
Ser Trp Pro 
390 

ggg ggc act 

Gly Gly Thr 



gga ace aca 
Gly Thr Thr 
415 

gac agt ttc 
Asp Ser Phe 
430 

cga get tec 
Arg Ala Ser 



tct caa gtg 
Ser Gin Val 
455 

aat gtg tct 
Asn Val Ser 



ggc aga aaa 
Gly Arg Lys 
480 

ttt gtg gag 
Phe Va?x Glu 
495 

gag gcg egg 
Glu Ala Arg 



gga act gtg 
Gly Thr Val 
520 

cag tat ttg 
Gin Tyr Leu 



aat gac gga 
Asn Asp Gly 
545 

act gee att 



Met Tyr 
380 

gtg cca 1512 
Val Pro 



ggc tta 1551 
Gly Leu 
405 

gaa gga 1590 
Glu Gly 



ata gat 1629 
He Asp 



cag aag 1668 
Gin Lys 
445 

ttc ata 1707 
Phe He 



ctg gga 1746 
Leu Gly 
470 

ggc etc 1785 
Gly Leu 



ctg ctg 1824 
Leu lieu 



age eta 1863 
Ser Leu 
510 

ccc ccc 1902 
Pro Pro 



gat tea 1941 
Asp Ser 
535 

aag gag 1980 
Lys Glu 



gee ttg 2019 
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Ser Glu Val Val Ser Phe Leu Thr Thr Ala lie Ala Leu 
550 555 560 

cct ccc cga ttg aaa gag atg aaa age cag gaa teg get 2 058 
Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu Ser Ala 
5 565 570 575 

gca ggt tec aaa eta gtc ctt egg tgt gaa aec agt tct 2097 
Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
580 585 

gaa tac tec tct cte aga ttc aag tgg ttc aag aat ggg 2136 
10 Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly 
590 595 600 

aat gaa ttg aat cga aaa aac aaa cea caa aat ate aag 2175 
Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys 
605 610 

15 ata caa aaa aag cea ggg aag tea gaa ctt egc att aac 2214 
lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn 
615 620 625 

aaa gca tea ctg get gat tct gga gag tat atg tgc aaa 2253 
Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys 
20 630- 635 640 

gtg ate age aaa tta gga aat gae agt gee tet gee aat 2292 
Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 
645 650 

ate ace ate gtg gaa tea aac gag ate ate act ggt atg 2331 
25 lie Thr He Val Glu Ser Asn Glu lie He Thr Gly Met 
655 660 665 

cea gee tea act gaa gga gca tat gtgr tct tea gag tet 2370 
Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser 
670 675 

30 ccc att aga ata tea gta tec aca gaa gga gca aat act 2409 
Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
680 685 690 

tct tea tct aca tct aca tec acc act ggg aca age cat 2448 
Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
35 695 700 705 

ctt gta aaa tgt geg gag aag gag aaa act ttc tgt gtg 2487 
Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val 
710 715 

aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea aac 2526 
40 Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn 
720 725 730 

ccc teg aga tac ttg tgc aag tgc cea aat gag ttt act 2565 
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Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr 
735 740 

ggt gat cgc tgc caa aac tac gta atg gcc age ttc tac 2604 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr 
5 745 750 755 

agt acg tec act ccc ttt ctg tct ctg cct gaa tag 264 0 
Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
760 765 768 

gagcatgctc agttggtgct gctttcttgt tgctgcatct cccctcagat 2690 

10 tccacctaga gctagatgtg tcttaccaga tctaatattg actgcctctg 2740 

cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 2790 

gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt 2 840 

tctggaattg atattgaatg atgtgataca aattgatagt caatatcaag 2890 

cagtgaaata tgataataaa ggcatttcaa agtctcactt ttattgataa 2940 

15 aataaaaatc attctactga acagtccatc ttctttatac aatgaccaca 2 990 

tcctgaaaag ggtgttgcta agctgtaacc gatatgcact tgaaatgatg 304 0 

gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 3 090 

aaaaggaaaa aaaaaaaaaa a 3111 

<210> 13 
20 <211> 1872 
<212> DNA 
<213> Homo sapiens 



<400> 13 





gaattcggga 


cagcctctcc 


tgccgccgct 


gctgctgccg 


ccgccgccac 


SO 


25 


cgccggctgg 


tcctccttct 


gcttttactt 


ctcctgcatg 


acagttgttt 


100 




tcttcatctg 


agcagacacc 


agcttcagat 


gctcgaggtg 


agaaacatgc 


150 




ctttcagttt 


gggctactgg 


tttacttaat 


taatcagccg 


gcagctccgt 


200 




cgatctattt 


tcgtccctgt 


octet tgacg 


agcccgggat 


ggtttggagt 


250 




agcatttaaa 


agaactagaa 


aagtggccca 


gaaacagcag 


cttaaagaat 


300 


30 


tattacgata 


tactttgatt 


ttgtagttgc 


taggagcttt 


tcttcccccc 


350 




ttgcatcttt 


ctgaactctt 


cttgatttta 


ataatggcct 


tggacttgga 


400 




cgatttatcg 


atttccccct 


gtaagatgct 


gtatcatttg 


gttggggggg 


450 




cctctgcgtg gtaatggacc 


gtgagagcgg 


ccaggccttc 


ttctggaggt 


500 
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gagccg atg gag att tat tec cca gac atg tct gag gtc 539 
Met Glu lie Tyr Ser Pro Asp Met Ser Glu Val 
15 10 

gcc gcc gag agg tec tec age cec tee act cag ctg agt 578 
5 Ala Ala Glu Arg Ser Ser Ser Pro Ser Thr Gin Leu Ser 

15 20 

gea gac cca tct ett gat ggg ett ecg gca gea gaa gac 617 
Ala Asp Pro Ser Leu Asp Gly Leu Pro Ala Ala Glu Asp 
25 30 35 

10 atg cca gag cec cag act gaa gat ggg aga ace ect gga 656 
Met Pro Glu Pro Gin Thr Glu Asp Gly Arg Thr Pro Gly 
40 45 50 

etc gtg gge ctg gcc gtg cec tgc tgt gcg tgc eta gaa 695 
Leu Val Gly Leu Ala Val Pro Cys Cys Ala Cys Leu Glu 
15 55 60 

get gag cgc ctg aga ggt tgc etc aac tea gag aaa ate 734 
Ala Glu Arg Leu Arg Gly Cys Leu Asn Ser Glu Lys lie 
65 70 75 

tgc att gtc cec ate ctg get tgc ctg gtc age etc tgc 773 
2 0 Cys lie Val Pro He Leu Ala Cys Leu Val Ser Leu Cys 

80 85 

etc tgc ate gcc gge etc aag tgg gta ttt gtg gac aag 812 
Leu Cys He Ala Gly Leu Lys Trp Val Phe Val Asp Lys 
90 95 100 

25 ate ttt gaa tat gac tct cet act cac ett gac cct ggg 851 
lie Phe Glu Tyr Asp Ser Pro Thr His Leu Asp Pro Gly 
105 110 115 

9S9 tta gge cag gac cct att att tct ctg gac gca act 890 
Gly Leu Gly Gin Asp Pro He He Ser Leu Asp Ala Thr 
30 120 125 

get gcc tea get gtg tgg gtg teg tct gag gca tac act 929 
Ala Ala Ser Ala Val Trp Val Ser Ser Glu Ala Tyr Thr 
130 135 140 

tea cct gtc tct agg get caa tct gaa agt gag gtt caa 968 
35 Ser Pro Val Ser Arg Ala Gin Ser Glu Ser Glu Val Gin 

145 150 

gtt aca gtg caa ggt gac aag get gtt gtc tec ttt gaa 1007 
Val Thr Val Gin Gly Asp Lys Ala Val Val Ser Phe Glu 
155 160 165 

40 cca tea gcg gca ccg aca ceg aag aat cgt att ttt gcc 1046 
Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg He Phe Ala 
170 175 180 
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ttt tct ttc ttg ccg tec act gcg cca tec ttc cct tea 1085 
Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser 
185 190 

cce acc egg aae cct gag gtg aga acg ccc aag tea gea 1124 
5 Pro Thr Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala 
195 200 205 

act cag cca caa aca aea gaa act aat etc caa act get 1163 
Thr- Gin Pro Gin Thr Thr Glu Thr Asn Leu Gin Thr Ala 
210 . 215 

10 cct aaa ctt tct aca tct aca tec acc act ggg aca age 1202 
Pro Lys Leu Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
220 225 230 

cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt 1241 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
15 235 240 245 

gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea 1280 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
250 255 

aac ccc teg aga tac ttg tgc aag tgc cca aat gag ttt 1319 
20 Asn Pro Ser Aig Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
260 265 270 

act ggt gat egc tgc caa aac tac gta atg gee age ttc 1358 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
275 280 

25 tac agt acg tec act ccc ttt ctg tct ctg cct gaa taggag 1400 
Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
285 290 295 296 

catgctcagt tggtgctgct t tct tgt tgc tgcatctccc ctcagattcc 1450 

acetagagct agatgtgtct taccagatct aatattgact gcetetgcet 1500 

30 gtcgcatgag aacattaaca aaagcaattg tattacttcc tctgttcgeg 1550 

actagttggc tctgagatac taataggtgt gtgaggctcc ggatgtttct 1600 

ggaattgata ttgaatgatg tgatacaaat tgatagtcaa tatcaagcag 1650 

tgaaatatga taataaagge atttcaaagt etcactttta ttgataaaat 1700 

aaaaatcatt ctactgaaca gtccatcttc tttatacaat gaccacatcc 1750 

35 tgaaaagggt gttgctaage tgtaaccgat atgeacttga aatgatggta 1800 

agttaatttt gattcagaat gtgttatttg tcacaaataa acataataaa 1850 

aggaaaaaaa aaacccgaat te 1872 
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<210> 14 
<211> 296 
<212> PRT 

<213> Homo sapiens 
5 <400> 14 

Met Glu lie Tyr Ser Pro Asp Met Ser Glu Val Ala Ala Glu Arg 
1 5 10 15 

Ser Ser Ser Pro Ser Thr Gin Leu Ser Ala Asp Pro Ser Leu Asp 
20 25 30 

10 Gly Leu Pro Ala Ala Glu Asp Met Pro Glu Pro Gin Thr Glu Asp 

35 40 45 

Gly Arg Thr Pro Gly Leu Val Gly Leu Ala Val Pro Cys Cys Ala 
50 55 60 

Cys Leu Glu Ala Glu Arg Leu Arg Gly Cys Leu Asn Ser Glu Lys 
15 65 70 75 

lie Cys lie Val Pro lie Leu Ala Cys Leu Val Ser Leu Cys Leu 
80 85 90 

Cys lie Ala Gly Leu Lys Trp Val Phe Val Asp Lys lie Phe Glu 
95 lOO 105 

20 Tyr Asp Ser Pro Thr His Leu Asp Pro Gly Gly Leu Gly Gin Asp 

110 115 120 

Pro lie lie Ser Leu Asp Ala Thr Ala Ala Ser Ala Val Trp Val 
125 130 135 

Ser Ser Glu Ala Tyr Thr Ser Pro Val Ser Arg TLLa Gin Ser Glu 
25 140 145 150 

Ser Glv Val Gin Val Thr Val Gin Gly Asp Lys Ala Val Val Ser 
155 160 165 

Phe Glu Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg lie Phe Ala 
170 175 180 

30 Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser Pro Thr 

185 190 195 

Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala Thr Gin Pro Gin 
200 205 210 

Thr Thr Glu Thr Asn Leu Gin Thr Ala Pro Lys Leu Ser Thr Ser 
35 215 220 225 

Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys 
230 235 240 

Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys 
245 250 255 
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Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu 
260 265 270 

Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr 
275 280 285 

Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
290 295 296 
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